Developmental tyrosine kinases and their ligands

ABSTRACT

The invention relates to mammalian receptor tyrosine kinases designated developmental tyrosine kinases (Dtks). Dtks are expressed in multipotential haematopoietic cells, in embryonic stem cells, in brain tissue and in testis, but are not expressed mature lineage-restricted haematopoetic cells.  
     The invention provides full-sequence Dtks as well as extracellular receptor domains of such Dtks. The invention further provides nucleic acid molecules encoding such Dtks, vectors containing DNA encoding such Dtks, ligands which bind to such Dtks, and methods of therapeutic and/or prophylactic treatment employing either the ligands or extracellular receptor domains.

FIELD OF THE INVENTION

[0001] The present invention generally relates to protein tyrosine kinase receptors widely expressed by early cells of the haematopoietic system, by cells of the neuronal system in brain tissue, and in testis, ligands for such receptors and nucleic acid molecules encoding such receptors.

BACKGROUND OF THE INVENTION

[0002] There are several parallels between the development of the haematopoietic and neuronal systems. In particular, the presence of regulatory protein Molecules termed growth factors which recognise and bind to specific cell membrane receptors is a common feature of these two systems. It is possible that shared families of receptors exist that are expressed in both early haematopoietic and neuronal stem cells. In turn, there may be a family of proteins which bind these receptors and function as stem cell growth factors.

[0003] The current view of vertebrate haematopoietic ontogeny holds that a succession of pluripotential stem cell migrations originate in the yolk sac blood islands, initially invade the hepatic rudiment, and then the spleen and bone marrow. From the bone marrow, a limited number of multipotential stem cells are laid down during embryogenesis that give rise to a much larger population of developmentally restricted progenitor cells, and ultimately produce the mature cells of at least eight cell lineages. The cells of these lineages are classified as red and white blood cells. The white blood cells contain the mature cells of the lymphoid and myeloid systems. Lymphoid cells contain T and B lymphocytes and are derived from pre-T and pre-B cells, respectively. The myeloid system comprises several cell types known as granulocytes, platelets, monocytes, macrophages, and megakaryocytes. The granulocytes are further divided into neutrophils, eosinophils, basophils and mast cells (see review by Metcalf D. The Molecular Control of Blood Cells , Harvard Univ Press, 1988).

[0004] The haematopoietic system functions by precisely controlling the production of cells in the various lineages. Totipotent haematopoietic stem cells have the ability to both self-renew and differentiate. Stem cells undergo a series of differentiation steps leading to increasingly lineage-restricted progenitor cells. The more mature progenitor cells are restricted to production of only one or two lineages. For some time the colony-forming unit-spleen (CFU-S) assay served to operationally define all stem cells. Recent evidence demonstrates heterogeneity within CFU-S, with only a small fraction of CFU-S capable of contributing to long-term repopulation following ablation of the haematopoietic system by irradiation. It is recognised that stem and progenitor populations are not discrete, but represent a continuum of cells from those of high self-renewal capacity and low probability of differentiation to those cells with low self-renewal probability and high commitment to differentiation. When long-term haematopoiesis is investigated at the clonal level, studies have shown that single stem cell clones are sufficient to maintain haematopoiesis over the lifetime of an animal.

[0005] The development of the mammalian embryo is governed by interactions between different embryonic cell populations. This process is manifest at the cellular level in the precise temporal and spatial control of proliferation, differentiation and migration. The coordination of these processes may be achieved in part by the action of a family of regulatory molecules termed growth factors. Growth factors can evoke diverse responses in different cell types and may interact with one another synergistically or antagonistically. Their action is complex and most of our current understanding results from in vitro experiments. In most instances, haematopoietic growth factor actions defined in vitro have been confirmed in vivo. In haematopoiesis, some growth factors are lineage-restricted in their action. These include erythropoietin that acts predominantly on red cell development, and granulocyte colony-stimulating factor that's predominant action is on granulocytes. At the other end of the spectrum is interleukin-3 which can act on several target cells such as granulocyte-macrophage progenitors, eosinophils, megakaryocytes, erythroid cells and mast cells. There are no known growth factors that function exclusively on haematopoietic stem cells.

[0006] The ligand for c-kit, termed stem cell factor, kit ligand or mast cell growth factor is the product of the Steel (SI) locus in mice. The factor acts either alone or synergistically with several known growth factors on primitive stem cells. It is believed that this factor is essential for the development of early haematopoietic stem cells, and cells of the erythroid and mast cell lineages.

[0007] The stem cell compartment may be viewed as a finely tuned balance between the action of inhibitors and the stimulatory role of cytokines. As with other stem cell systems, haematopoietic stem cells are distributed in a defined spatial manner within adult bones and not in a random, homogeneous mixture of interacting cell types. A concept that underlies the regulation of haematopoietic stem cell development is that these cells reside within a specialised microenvironment, where the regulatory signals act locally. Stromal cells constitute the bone marrow microenvironment.

[0008] Embryonic stem cells are permanent cell lines established directly from the inner cell mass of the preimplantation mouse embryo. They retain the ability to participate in normal embryonic development and, following introduction into the blastocyst, generate chimaeric animals that are mosaic in all tissues. Embryonic stem cells are increasingly being used as cellular vectors for experimentally manipulating the mouse genome.

[0009] Doetschman has demonstrated that embryonic stem cells can generate primitive erythroid cells in culture (Doetschman et. al. J. Embryol. Exa. Morphol. 87, 27-45, (1985)). This result was achieved by inducing embryonic stem cells to form cystic embryoid bodies in the presence of preselected batches of human cord serum.

[0010] In addition to haematopoietic cell development, it has been noted that neurons also arise in differentiating embryonic stem cells. Haematopoietic differentiation in this system occurred infrequently, slowly and was not synchronized. Recently a modified system enabling the differentiation of embryonic stem cells in methylcellulose into multiple haematopoietic lineages has been described by Wiles and Keller (Development 111, 259-267, (1991)). Using this approach, macrophages, neutrophils, erythroid cells and mast cells develop in a synchronous manner with high frequency in the absence of human cord serum. The development of haematopoiesis from embryonic stem cells in methylcellulose cultures parallels the onset of haematopoiesis in the developing mouse embryo.

[0011] An important objective in the field of developmental biology is the identification of genes, the products of which mediate regulatory signals required during embryogenesis. There is compelling evidence that genes encoding receptor tyrosine kinases (RTKs) are involved in early development in vertebrates. The general family of protein tyrosine kinases can be recognised by the presence of several conserved amino acid regions in the catalytic domain. These conserved regions have been summarised by Hanks et al (Science 241, 42-52, (1988)) and by Wilks et al (Proc. Natl. Acad. Sci. USA 86, 1603-1607, (1989)). The receptor for macrophage colony-stimulating factor c-fms, which is important in myeloid cell differentiation and placental development is an RTK. The mouse developmental mutation W has been shown to involve an RTK. The W locus encodes the c-kit RTK and affects the proliferative and/or migratory properties of primordial germ cells, melanoblasts and haematopoietic stem cells. A recently described RTK termed flk-2, which is related to c-kit, has been isolated using the polymerase chain reaction (PCR) with oligonucleotides to conserved kinase domain motifs. Messenger RNA transcripts for flk-2 are expressed in populations enriched for stem cells and primitive uncommitted progenitor cells, and are absent in mature haematopoietic cells (see Matthews et al. Cell 65, 1143-1152, (1991)).

[0012] Additional receptor tyrosine kinases expressed on pluripotential haematopoietic stem cells are needed to facilitate the in vitro growth of stem cells. The nucleic acid molecules that encode receptor tyrosine kinases expressed by pluripotential stem cells are needed to produce recombinant receptors and ligands.

[0013] In vertebrate development, the cells whose descendants give rise to the nervous system are first identified as the neural ectoderm. This forms a tube-like structure beneath the surface of the ectoderm. Following closure of the neural tube some precursor cells detach from the apical neural tube and form a transient structure called the neural crest. These cells rapidly disperse into the embryo along complex migratory pathways. The proliferating neural crest cells also invade developing tissues such as the skin, gut, and the adrenal gland to form differentiated cell populations within these tissues; eg. melanocytes, enteric neurons and adrenal medullary chromaffin cells.

[0014] The diversity of cell types derived from the neural crest poses the problem of how uncommitted embryonic cells acquire particular developmental fates. There are strong parallels between neural crest cell lineage diversification and the process of haematopoiesis. It has been proposed that the earliest neural crest cells should be multipotent and maybe capable of self renewal. Secondly, it should be possible to identify committed progenitors that proliferate symmetrically and are restricted to distinct sublineages and thirdly, there should exist factors which influence the proliferation and/or differentiation of specific types of progenitors (see Anderson Neuron 3, 1-12, (1989)).

[0015] Soluble proteins variously termed neurotrophic, growth, and neuronal differentiation factors have been identified that influence the developmental growth, maintenance of function, and plasticity of neuronal populations. These factors have been implicated in the proliferation and differentiation of neurons during embryonic development and in their growth and survival in the adult nervous system. There are a growing number of neurotrophic factors, including nerve growth factor, brain-derived neurotrophic factor, neurotrophin-3 and neurotrophin-4. These molecules constitute a closely related family sharing at least 60% amino acid identity. If the parallel to the haematopoietic system is extended, the range and complexity of cells derived from the neural crest implies that there will be a large number of protein regulators which control this system.

[0016] Two different types of receptors have been demonstrated for neurotrophins. One group of these receptors are transmembrane glycoproteins with tyrosine kinase activity encoded by members of the trk protooncogene family. It would therefore be important to isolate additional receptor tyrosine kinases from developing systems such as embryonic stem cells which contain neurons. Ligands for such receptors are required to act inter alia as neurotrophic factors. Nucleic acid molecules encoding the receptors and ligands are needed to produce recombinant receptors and ligands.

[0017] It is the object of the present invention to go some way towards fulfilling the above objectives or at least to provide the public with a useful choice.

SUMMARY OF THE INVENTION

[0018] The present invention has a number of aspects. In a first aspect, the invention provides a mammalian receptor tyrosine kinase which is a developmental tyrosine kinase (Dtk) and which is expressed in multipotential haematopoietic cells, in embryonic stem cells, in brain tissue and in testis, but which is not expressed in mature lineage-restricted haematopoietic cells.

[0019] In a further aspect, the invention provides an extracellular receptor domain of a receptor tyrosine kinase as defined above. In preferred embodiments, this extracellular receptor domain can be bound to a support, or can be in a soluble form.

[0020] In still a further aspect, the invention provides a nucleic acid molecule encoding a receptor tyrosine kinase or extracellular receptor domain as defined above. This nucleic acid molecule is preferably DNA.

[0021] In yet a further aspect, the invention provides a vector including a DNA molecule as defined above.

[0022] In still a further aspect, the invention provides a method of producing a receptor tyrosine kinase comprising the steps of:

[0023] (a) culturing a host cell which has been transformed or transfected with a vector as defined above to express the encoded receptor tyrosine kinase or extracellular receptor domain; and

[0024] (b) recovering the expressed receptor tyrosine kinase.

[0025] As yet an additional aspect, the invention provides a ligand that binds to a receptor tyrosine kinase as defined above.

[0026] The ligand can take two forms. In one form, the ligand stimulates the proliferation, differentiation and/or survival of cells which express a receptor tyrosine kinase as defined above (a stimulant ligand).

[0027] In the second form, the ligand is antagonistic and at least partially blocks or inhibits the function of a receptor tyrosine kinase as defined above through binding to said receptor (an antagonistic ligand).

[0028] In another aspect, the invention provides a method of stimulating the proliferation, differentiation and/or survival of a cell expressing a receptor tyrosine kinase as defined above comprising contacting the cell with a stimulant ligand as defined above.

[0029] In yet a further aspect, the invention provides a method of inhibiting the function of a receptor tyrosine kinase as defined above comprising contacting the receptor with an antagonistic ligand as defined above.

[0030] In still another aspect, the invention provides a method of treating a disease, syndrome or condition caused or mediated by an excess of a ligand as defined above comprising the step of contacting said excess of said ligand with an effective amount of a receptor tyrosine kinase or an extracellular receptor domain as defined above.

[0031] In another aspect, the invention provides a method of extracting a ligand from a medium which may contain said ligand comprising the step of contacting said medium with a receptor tyrosine kinase or with an extracellular receptor domain as defined above.

[0032] The invention also provides a method of isolating ligand(s) from a medium which may contain said ligand(s), comprising the steps of:

[0033] (a) contacting said medium with an effective amount of a receptor tyrosine kinase or an extracellular domain as defined above;

[0034] (b) detecting which ligand(s) bind; and

[0035] (c) isolating such bound ligand(s).

[0036] While the invention broadly consists in the foregoing, it should be appreciated that it also includes the more specific embodiments detailed in the following description:

DESCRIPTION OF THE FIGURES

[0037]FIG. 1 shows expression of murine Dtk in embryonic stem (ES) cells and embryoid bodies. RNase protection analysis was performed on total RNA (10 μg) from ESD3 ES cells growing in Leukemia Inhibitory Factor (LIF) (day 0), or from ES cells maintained in the absence of LIF that were differentiating and developing into cystic embryoid bodies (days 2 to 18). As a control tRNA (10 μg) was also used. The markers were pBR322 digested with Msp I. The size of the free murine Dtk probe was 229 nt. A fully protected fragment representing the presence of murine Dtk transcripts was 187 nt in length. The free B-actin protected fragment is shown in each lane as an RNA loading control.

[0038]FIG. 2 shows expression of murine Dtk in embryonic mouse tissues. RNase protection analysis-was performed on total RNA (10 μg) isolated from E14.5 embryonic tissues of the C57BL/6J mouse strain. Details of the markers, probes and controls are as described for FIG. 1.

[0039]FIGS. 3 and 4 show expression of murine Dtk in adult mouse tissues. RNase protection analysis was performed on total RNA (10 μg) isolated from the various tissues of adult C57BL/6J mice. Details of the markers, probes and controls are as described for FIG. 1.

[0040]FIG. 5 shows expression of murine Dtk in murine cell lines. The most abundant expression is in the multipotential cell lines FDC-P1 and DA2, and the mast cell line P815. The majority of other cell lines art lineage-committed, mature haematopoietic cell lines, which have very limited murine Dtk expression. The NIH 3T3 cell line is derived from embryonic fibroblasts and C2C12 is a myoblast cell line.

[0041]FIG. 6 shows the cDNA and amino acid sequence of murine Dtk.

[0042]FIG. 7 shows the cDNA and amino acid sequence of human Dtk.

DETAILED DESCRIPTION OF THE INVENTION

[0043] A. Receptors

[0044] In a first aspect, this invention provides a mammalian receptor protein tyrosine kinase (PTK). The mammal in which the PTK exists may be any mammal, such as a mouse, rat, rabbit or human.

[0045] Members of the PTK family are recognised by the conserved amino acid regions in the catalytic domains. Examples of PTK consensus sequences have been provided by Hanks et al. (Science 241 42-52 (1988), especially FIG. 1 starting at page 46) and by Wilks et al. (Proc. Natl. Acad. Sci. USA 86, 1603-1607 (1989), especially FIG. 2 on page 1605).

[0046] Hanks et al. identify eleven catalytic subdomains containing FIK consensus residues and sequences. The PTKs of the present invention contain most or all of these consensus residues and sequences.

[0047] As indicated above, the PTKs of the invention are receptor PTKs and so are also generally referred to as RTKs. Further, as the applicants believe that the RTKs of the invention are involved in mammalian cell development, they are specifically referred to hereinafter as developmental tyrosine kinases (Dtks).

[0048] The Dtks of the invention are transmembrane receptor tyrosine kinases whose extracellular domains contain two immunoglobulin-like motifs followed by two fibronectin-type III repeats. RTKs of this structure (Axl(Ufo,Ark)) are already known (Janssen et al., Oncozene 6, 2113-2120 (1991); O'Bryan et al., Mol. Cell. Biol 11, 5016-5031 (1991); Rescigno et al. Oncogene 6, 1909-1913 (1991); Faust et al. Oncogene 7, 1287-1293 (1992)). The Dtks of the invention are however distinguished from those RTKs having the equivalently structured extracellular domains by their potential function based upon their distribution within the mammalian body.

[0049] With regard to this latter feature of the Dtks of the invention, the applicants have conducted experiments to determine the range of cells in which the developmental tyrosine kinases of the invention are expressed. These experiments were specifically performed in relation to murine Dtk but are believed to be illustrative of the expression of all mammalian Dtks of the invention.

[0050] A.1 Analysis of Murine Dtk Expression

[0051] The expression of murine Dtk in a range of embryonic and adult mouse tissues was analyzed by ribonuclease protection analysis, using a probe that encompassed sequences encoding the membrane-proximal portion of the extracellular domain of the receptor.

[0052] Materials and Methods

[0053] 1. Embryonic Stem Cell Culture

[0054] The ESD3 embryonic cell line (Doestschman et al., J. Embryol. Exp. Morphol. 87 27-45 (1985)) was maintained on gelatin-coated dishes in Dulbecco's-modified Eagle's medium (DMEM) with additives according to established procedures (Hogan et al., Cold Spring Harbour Laboratory, 1-332 (1986)), in the presence of LIF. Cystic embryoid bodies were established following collagenase treatment of the ES cells and subsequent suspension culture in bacteriological-grade petri dishes in DMEM with additives in the absence of LIF (Wiles and Keller, Development 111, 259-265 (1991)).

[0055] 2. Fetal Liver Haematopoietic Stem Cell Enrichment

[0056] Low density haematopoietic stem cells were isolated from an E14.5 fetal liver cell suspension using equilibrium density centrifugation on a discontinuous metrizamide gradient according to the method of Visser et al., J. Exp. Med, 59, 1576-1590 (1984). Following this procedure, low density fetal liver cells (p24<1.078 g/cm³) were incubated for 20 minutes on ice in DMEM medium with 5 μg/10⁶ cells of AA4 monoclonal antibody (rat IgG_(2b); McKeam et al., Proc. Natl. Acad. Sci. USA, 82, 7414-7418 (1985)) and washed twice. This antibody has been shown to recognise the most primitive haematopoietic stem cell in fetal liver (Jordan et al., Cell, 61, 953-963 (1990)). The AA4 labelled cells were then incubated on ice for 20 minutes with magnetic beads conjugated with anti-rat IgG antibody as outlined in the manufacturer's protocol (Advanced Magnetics Corp., Cambridge, Mass.). Following incubation, AA4⁺ cells were positively-selected on a magnet. Stem cell enrichment was assessed by re-labelling the cells with the AA4 antibody, followed by a second layer antibody staining with goat anti-rat fluorescein isothiocyanate and flow cytometric analysis on a FACS 440 (Becton Dickinson, San Jose, Calif.).

[0057] 3. RNA Analysis

[0058] RNase protection analysis was performed by hybridization of 10 μg of total RNA to RNA probes that encoded sequences of murine Dtk and B-actin, overnight at 52° C. RNase digestion was performed with RNase T1 (1.75 μ/ml) and RNase A (35 μg/ml) at 37° C. for one hour. The reaction was stopped with proteinase K (333 μg/ml) and SDS (0.3%). The products were run on a 6% urea/acrylamide gel and the autoradiograph exposed at −70° C. The probe for analysis of Dtk expression was derived from nucleotides 1158 to 1334 of the Dtk sequence, a segment which encodes the membrane-proximal portion of the extracellular domain, and which had been subcloned into pGEM-4Z. In an RNase protection assay, the free probe yielded a 229 nucleotide (nt) band, and Dtk transcripts protected a fragment of 187 nt. A riboprobe was also constructed from a Sal I-Sma I fragment of human β-actin. The length of the free β-actin probe was 132 nt and β-actin transcripts protected a 54 nt fragment.

[0059] Results

[0060] 1. Embryonic Stem Cells

[0061]FIG. 1 demonstrates the expression of Dtk transcripts in both totipotent ES cells growing in LIF (termed day 0), and in differentiating cystic embryoid bodies growing in the absence of LIF for up to 18 days. In this developmental system Dtk is expressed almost uniformly from days 0 to 18, indicated by the presence of a protected 187 nt band for each RNA analyzed. The two bands of approximately 220 nt and 210 nt present in lanes for each RNA sample analyzed are also present in the tRNA lane and are regarded as nonspecific. Of considerable interest with regard to the importance of this receptor in mouse development is the demonstration of Dtk expression in totipotent ES cells. The ES cells from which RNA was extracted for day 0 analysis were selected from cultures, following morphological assessment by phase-contrast microscopy to confirm that they were undifferentiated.

[0062] 2. Embryonic Tissues

[0063] Expression of Dtk was detected in total RNA isolated from a wide range of midgestational E14.5 embryonic mouse tissues including the brain, eye, thymus, lung, intestine, forelimb, hindlimb and testis (FIG. 2). There was limited expression in heart and unfractionated liver.

[0064]FIG. 2 shows enrichment of Dtk transcripts in E14.5 fetal liver low density AA4⁺ haematopoietic stem cells. Following density-gradient centrifugation and positive selection, the cells used for RNA analysis were greater than 95% AA4⁺, as assessed by flow cytometry (data not shown). Dtk expression was also detected in day 14.5 placenta.

[0065] 3. Adult Tissues

[0066] In contrast to the widespread expression of Dtk in embryonic tissues, the pattern of expression in adult tissues becomes restricted (FIGS. 3 and 4). Dtk transcripts were most abundant in brain, esophagus, bladder, testis, and ovary. In brain, expression of Dtk (relative to β-actin) was more abundant in adult than in embryonic tissue. Adult tissues which contained less abundant, but detectable transcripts were lung, and regions of the gastrointestinal tract including the stomach and both the small and large intestine. Tissues in which Dtk transcripts were undetectable or expressed at extremely low levels included the salivary gland, thymus, heart, liver, skeletal muscle, kidney, spleen, bone marrow, adrenal gland and uterus.

[0067] 4. Murine Cell Lines

[0068] The pattern of expression of Dtk in murine cell lines was analyzed in relation to the following: WEHI-3B, 416B, ELM, SO3, SP2/0, P388D₁, P815, FDC-P1, DA2, FDC-P1/IL-2 ras, NIH3T3 and C2C12. The results are shown in FIG. 5.

[0069] As can be seen from FIG. 5, the results are consistent with those above, with the most abundant expression being in the multipotential cell lines FDC-P1 and DA2, and in mast cell line P815. Significant expression is also observed in myoblast cell line C2C12.

[0070] In contrast, the remaining cell lines (lineage-restricted mature haematopoietic cell lines) show very limited murine Dtk expression.

[0071] From this analysis, the applicants have derived the condition defining the Dtks of the invention—they are expressed in multipotential haematopoietic cells, in totipotent embryonic stem cells, in brain tissue and in testis, but not in mature lineage-restricted haematopoietic cells.

[0072] For the purpose of this specification, a multipotential haematopoietic cell is an early haematopoietic cell. Examples of multipotential haematopoietic cells include multipotential factor-dependent cells that have the capacity to proliferate and differentiate into mature haematopoietic cells. In contrast, a mature haematopoietic cell is non self-renewing and has limited ability to give rise to multiple cell lineages. Mature lineage-restricted haematopoietic cells, for the purposes of this specification, are therefore represented by haematopoietic cell lines of the T or B lymphoid lineage or mature myeloid lineages.

[0073] The Dtks of the present invention may or may not be expressed in intermediate cells poised between the state of being multipotential and mature.

[0074] In terms of brain tissue, the Dtks of the invention are primarily expressed in neuronal cells.

[0075] In terms of testis, the Dtks are primarily expressed in the Sertoli cells.

[0076] It will of course be appreciated by those persons skilled in this art that the reference to the Dtks of the invention not being expressed in mature lineage-restricted haematopoietic cells is in a biological context and does not mean that there is absolutely no expression of the Dtk in these cells. As is apparent from FIGS. 1 to 5, what is meant by the phrase “not expressed in mature-lineage restricted haematopoietic cells” is that there is no significant expression of the Dtk in the cell, i.e. that expression is either undetectable or at an extremely low level.

[0077] The restricted expression of the Dtks of the invention to cells representative of early multipotential cells, with substantial absence of expression in lineage-restricted cells such as T or B lymphocytes, is consistent with this receptor functioning and transducing signals from the microenvironment to the haematopoietic stem cell compartment. The expression of the Dtk in embryonic stem cells and in some fetal tissues such as brain is also consistent with this receptor and its ligand having a functional role in the specification of cell lineages during embryonic development, including neuronal development. Furthermore, the receptor and its ligand is likely to have a role in the maintenance of function and plasticity in neuronal populations or their derivatives. Finally, the expression of the receptor in adult brain is consistent with the receptor and its ligand having a role in the growth and survival of neurons in the adult nervous system.

[0078] The embryonic stem cell and haematopoietic multipotential cell line nRNA for Dtk migrates relative to 28S and 18S ribosomal bands on formaldehyde agarose gels at approximately 4.2 Kb. In adult brain tissues, Dtk mRNA migrates at approximately 4.2 Kb.

[0079] The Dtks of the invention can usefully be provided in a number of different forms. These include the Dtk itself, the “mature” form of the Dtk, and the extracellular receptor domain of the Dtk.

[0080] The “mature” form of the Dtk of the invention is the Dtk less its native amino-terminus leader or signal sequence, whereas the extracellular receptor domain is the Dtk lacking the transmembrane region and catalytic domain.

[0081] The extracellular domain may be identified through commonly recognised criteria of extracellular amino acid sequences. The determination of appropriate criteria is known to those skilled in the art, and has been described, for example by Hopp et al., Proc. Natl. Acad. Sci. USA 78, 3824-3828 (1991); Kyte et al., J. Mol. Biol. 157, 105-132 (1982); Emini, J. Virol 55, 836-839 (1985); Jameson et al. CA BIOS 4 181-186 (1988); and Karplus et al. Naturwissenschaften 72, 212-213 (1985). Amino acid domains predicted by these criteria to be surface exposed are characteristic of extracellular domains.

[0082] The Dtks of the invention or their extracellular receptor domains may be prepared by methods known in the art. Such methods include protein synthesis from individual amino acids as described by Stuart and Young in “Solid Phase Peptide Synthesis”, Second Edition, Pierce Chemical Company (1984). It is however preferred that the Dtks and/or their extracellular receptor domains be prepared by recombinant methods as will be detailed hereinafter.

[0083] A.2 Specific Dtks of the Invention

[0084] A.2.1 Murine Dtk

[0085] As is indicated above, a first Dtk of the invention, murine Dtk, has been identified in certain tissues of the mouse. Murine Dtk generally has the nucleic acid and deduced amino acid sequence shown in FIG. 6. FIG. 6 represents individual amino acid residues as single letters as follows: Three-letter One-letter Amino Acid abbreviation symbol Alanine Ala A Arginine Arg R Asparagine Asn N Aspartic acid Asp D Asparagine or aspartic acid Asx B Cysteine Cys C Glutamine Gln Q Glutamic Acid Glu E Glutamine or glutamic acid Glx Z Glycine Gly G Histidine His H Isoleucine Ile I Leucine Leu L Lysine Lys K Methionine Met M Phenylalanine Phe F Proline Pro P Serine Ser S Threonine Thr T Tryptophan Trp W Tyrosine Tyr Y Valine Val V

[0086] Details of the sequence of murine Dtk are as follows.

[0087] Sequence Analysis of the Murine Dtk

[0088]FIG. 6 shows the 3.919 Kb nucleotide and deduced amino acid sequence for murine Dtk from murine neonatal brain. Within the 5′ region, a potential site for translation initiation (-GGAGCATGGGG-) is found within a good Kozak consensus sequence. The first methionine initiates an open reading frame of 874 amino acids. Using the method of von Heijne Sequence Analysis in Molecular Biology 113-117, San Diego, Academic Press (1987), the signal cleavage site is predicted to be between alanine 24 and alanine 25, which specifies a 24 amino acid hydrophobic leader sequence and a mature receptor tyrosine kinase protein of 850 amino acids. Amino acids AGLK to PHSR form a 386 amino acid extracellular domain. A 25 amino acid hydrophobic region from TSWV to LILL is consistent with that of a transmembrane domain (Fasman and Gilbert, Trends Biochem 15, 89-92 (1990)), while the remaining amino acids ending HSSC comprise the cytoplasmic domain.

[0089] The extracellular domain of murine Dtk contains eight consensus sites (NxT or S) for N-linked glycosylation, predicting that the mature Dtk protein is glycosylated. Within the extracellular domain, two repeating protein motifs are identifiable. Using the predictive methods of Williams and Barclay, Ann. Rev. Immunol 6, 381405 (1988), two C-type immunoglobulin (Ig)-like domains are present from amino acids KLMG to GEET (Ig-like domain I) and FFTV to NIKG (Ig-like domain II). The first Ig domain has a structure similar to a C1 domain, while the second Ig domain is more C2-like. Based on the analysis of Petersen et ai, Proc. Natl. Acad. Sci. USA 80, 137-141 (1983), there are two fibronectin type III modules present from amino acids PPAA to PYGD (domain I) and from amino acids PFQT to SHDH (domain II).

[0090] Analysis of the 439 amino acid cytoplasmic domain sequence of murine Dtk shows many of the motifs which are highly conserved within the catalytic kinase domain of protein tyrosine kinases (Hanks et al., Science 241, 42-52 (1988)). The motifs GKGEFG and VAVK, which function as the Mg²⁺-ATP binding site (Ullrich and Schlessinger, Cell 61, 203-212 (1990); Cantley et al., Cell 64, 281-302 (1991)), are observed at the start of the kinase domain. Further towards the carboxy-terminus of Dtk other conserved kinase motifs are identifiable, including the motif IHRDLAARN, the DFG triplet motif and the motifs KWLALES and DVWAFG. Alignment of the kinase domain of Dtk with other protein tyrosine kinase domains including that of Ufo, suggests there is a kinase insert region specified by the amino acids RIGENPFN. There are 12 tyrosine residues within the cytoplasmic domain of Dtk, including two residues located near the C-terminus that are nested within sequences that exhibit strong homology to Src homology 2 (SH2) domain bindingasites (Songyang et al., Cell 72, 767-778 (1993)). One of these sequences, EEVYDLM, is a putative binding site for phosphatidylinositol 3-kinase, but lies within the catalytic domain proper and is unlikely to be autophosphorylated. The sequence DPLYINI fulfills criteria for either a Sem5/Grb2 binding site or a phospholipase C-γ binding site (Songyang et al., (1993)) supra, and its position in the C-terminal tail makes it a good candidate for phosphorylation.

[0091] In specific aspects, the invention provides murine Dtk, mature murine Dtk and the extracellular receptor domain of murine Dtk.

[0092] Murine Dtk has the amino acid sequence given as SEQ ID NO 1.

[0093] Mature murine Dtk has the amino acid sequence given as SEQ ID NO 2.

[0094] The extracellular receptor domain of murine Dtk has the amino acid sequence given as SEQ ID NO 5.

[0095] The invention also includes functional equivalents of murine Dtk, mature murine Dtk and the extracellular receptor domain of murine Dtk as is described hereinafter.

[0096] A.2.2 Human Dtk

[0097] A second Dtk of the invention has been identified from human tissue. This second receptor is the human homologue of murine Dtk having all of the structural features of murine Dtk.

[0098] The nucleic acid and deduced amino acid sequence for this receptor tyrosine kinase, hereinafter called “human Dtk”, is shown in FIG. 7. Sequence details are as follows.

[0099] Sequence Analysis of the Human Dtk

[0100]FIG. 7 shows the 4.364 Kb nucleotide and deduced amino acid sequence for the human Dtk from human fetal brain. The structural features of human Dtk closely parallel those described for murine Dtk. The signal peptide encompasses amino acids MGRP to ESAA. The mature protein extends from residues AGLK to HSSC. Within the mature protein the extracellular domain is defined by residues AGLK to PHSR, the transmembrane domain by residues TSWV to LILL, and the cytoplasmic domain from residues RKRR to HSSC.

[0101] The extracellular domain contains two repeating protein motifs made up of two immunoglobulin domains (KLMG to GGET and FFTV to NLKG), followed by two fibronectin type III modules (LPAA to PYAD and PFQT to SHDR). The protein tyrosine kinase domain is encompassed by the amino acids LGKG to RMEL within the cytoplasmic domain. The motifs defined within the murine protein tyrosine kinase domain are also identifiable within the human protein tyrosine kinase domain.

[0102] Once again, in its specific aspects the invention provides different forms of the Dtk (human Dtk, “mature” human Dtk and the extracellular receptor domain of human Dtk).

[0103] Human Dtk has the amino acid sequence given as SEQ ID NO 3.

[0104] Mature human Dtk has the amino acid sequence given as SEQ ID NO 4.

[0105] The extracellular receptor domain of human Dtk has the amino acid sequence given as SEQ ID NO 6.

[0106] Once again, the invention further includes functional equivalents of human Dtk, mature human Dtk and of the extracellular receptor domain of human DTK.

[0107] A.2.3 Other Mammalian Dtks

[0108] In addition to the murine and human Dtks described above, the invention includes within its scope Dtks of other mammals. Such Dtks are the homologues of both murine and human Dtk and can be readily identified by those persons skilled in the art with reference to the characterising data given above for murine Dtk and human Dtk.

[0109] By way of example, one method for identifying other Dtks of the invention involves the formation of a DNA library from a suitable tissue source (such as brain) obtained from the mammal. This library can then be screened to identify DNA coding for homologues to murine Dtk and human Dtk as will be described in more detail below.

[0110] B. Nucleic Acid Molecules Encoding the Dtks of the Invention

[0111] In another aspect of this invention, the applicants provide nucleic acid molecules encoding the Dtks. These nucleic acid molecules may be DNA (isolated from nature, synthesised or cDNA) or RNA. Most often, the nucleic acid molecules will be DNA.

[0112] B.1 Nucleic Acid Molecules Encoding Murine Dtk and Human Dtk

[0113] As indicated above, the nucleic acid sequences for murine Dtk and human Dtk have been determined. In specific aspects, the invention therefore provides nucleic acid molecules (in the form of DNA) as follows:

[0114] 1. A DNA molecule encoding murine Dtk having the nucleotide sequence given as SEQ ID NO 7.

[0115] 2. A DNA molecule encoding mature murine Dtk having the nucleotide sequence given as SEQ ID NO 8.

[0116] 3. A DNA molecule encoding the extracellular receptor domain of murine Dtk having the nucleotide sequence given as SEQ ID NO 11.

[0117] 4. A DNA molecule encoding human Dtk having the nucleotide sequence given as SEQ ID NO 9.

[0118] 5. A DNA molecule encoding mature human Dtk having the nucleotide sequence given as SEQ ID NO 10.

[0119] 6. A DNA molecule encoding the extracellular receptor domain of human Dtk having the nucleotide sequence given as SEQ ID NO 12.

[0120] The invention also includes within its scope functional equivalents of these DNA molecules.

[0121] B.2 Nucleic Acid Molecules Encoding Dtks of Other Mammals

[0122] It will be appreciated that DNA molecules encoding the functional equivalent homologues of murine Dtk and human Dtk from other mammals are also within the scope of the invention. Such DNA molecules can be readily identified using conventional techniques and with reference to the information contained herein characterising murine Dtk and human Dtk.

[0123] By way of generic illustration, DNA molecules encoding homologues of murine Dtk and human Dtk in other mammals can be identified by employing the following general steps:

[0124] (a) Formation of a cDNA library:

[0125] Total mRNA from a suitable tissue source (such as brain) of the mammal is prepared by standard procedures (Ausubel et al, (Eds), “Current Protocols in Molecular Biology” Greene Associates/Wiley Interscience, New York (1990)), and cDNA synthesised. A cDNA library is formed (for example in λZAP II).

[0126] (b) Library Screening:

[0127] The cDNA library formed as above is screened for the presence of cDNA encoding homologues to murine Dtk and human Dtk. Screening will generally employ a DNA hybridisation or amplification step with the probes or primers being selected based upon the already determined sequences of murine and human Dtk.

[0128] Most conveniently, the screening procedure will involve DNA amplification using the polymerase chain reaction (PCR) (Saiki et al Science 239, 487 (1988)) with the PCR primers being selected such that highly conserved regions from within the DNA sequence of murine and human Dtk will be within the amplified PCR product.

[0129] (c) DNA Isolation and Sequencing:

[0130] Clones from the cDNA library which are identified by screening step (b) as containing cDNA encoding homologues to murine and human Dtk are selected, and the size of the cDNA insert sourced from the brain determined. Such clone(s) including a cDNA insert of the appropriate size to code for the full-length Dtk are selected and the cDNA insert isolated. Each isolated cDNA insert is then sequenced using known procedures (for example, using the standard dideoxy chain-termination method of Sanger et al., Proc. Natl. Acad. Sci. USA 74, 5463-5467 (1977)).

[0131] B.3 Genetic Mapping of Murine Dtk and Human Dtk

[0132] By way of further characterisation of both murine Dtk and human Dtk, the applicants have performed experiments to establish the chromosomes on which the genes coding for these Dtks are located. Details of these experiments are given below.

[0133] Materials and Methods

[0134] B.3.1 Fluorescent In Situ Hybridization (FISH)

[0135] A partial Sau3A genomic DNA library in λ 2001, prepared from mouse ES cells (Boehm et al., Proc. Natl. Acad. Sci. USA 88, 3927-3931 (1991)), was screened with the 3.525 kb cDNA insert purified from pMo23A using methods previously described (Morris, et al., Blood 76, 1812-1818 (1991)). Of 34 positive clones, two of the most intensely hybridizing, λMo23A-7.1 and λMo23A-8. 1, were selected for FISH studies. The pMo23A plasmid, and DNA isolated from bacteriophage clones λMo23A-7.1 and λMo23A-8.1, were biotinylated by nick-translation using biotin-14-dATP (Bethesda Research Laboratories, Gaithersberg, Md.). Karyotypically normal, 40, XY, mouse metaphase cells were prepared from ES cells in culture using standard procedures. Fluorescent in situ hybridization and detection procedures were essentially as described (Morris et al., Human Genetics 91, 31-36 (1993)), except that mouse Cot 1 DNA (BRL, final concentration 250 ng/μI) was used to suppress repetitive sequences in the two phage DNA probes. Chromosomes were G-banded using DAPI (4′, 6-diamidino-2-phenylindoledihydrochoride, Sigma, St Louis, Mo.) as a counterstain for fluorescence analysis.

[0136] B.3.2 Single-strand Conformation Polymorphism (SSCP)

[0137] Primer sequences from the 3′ untranslated region of the Dtk cDNA used for genetic mapping were as follows: DtkMap1 5′ TGGATGGCAGTAAGGGAGG 3′ (SEQ ID NO: 13) 5′ CTTAAGAGGGGCAAACCTGG 3′ (SEQ ID NO: 14) DtkMap2 5′ GCTTAGAGGAGGTGAGCCAGA 3′ (SEQ ID NO: 15) 5′ TGGGCAGTGCTGAGTTCC 3′ (SEQ ID NO: 16)

[0138] PCR was performed using standard conditions with the addition of ³²P-labelled dCTP. Specifically, 25,μl reactions were performed in 10 mM-Tris-HCI, 50 mM KCl using 250 ng of genomic DNA, 1 μM of each primer, and 1.4 mM MgCl₂. This was overlaid with oil, denatured at 94° C. for 5 minutes, and transferred to an 80° C. heating block. dNTPs were added to a final concentration of 0.2 mM, including 1.25 μCi of [α-³²P]dCTP (1 μl of a 3000 Ci/mmole stock to 8 reactions). 1.25 units of AmpliTaq DNA polymerase (Perkin-Elmer Cetus) was added and cycling conditions were as follows: 58° C. annealing reaction for 1 minute, 72° C. extension reaction for 2 minutes, and 91° C. denaturation for 1 minute. The cycle was repeated 30 times with a final 72° C. extension reaction for 5 minutes. SSCP analysis was performed by electrophoresing the single-stranded PCR products on a non-denaturing gel as follows: 2 μl of the PCR reaction was added to 8 μl of USB stop solution (100% formamide containing xylene cyanol and bromophenol blue).

[0139] This was denatured for 5 minutes at 94° C. and transferred to an ice bucket. 3 μl was loaded on a 5% non-denaturing acrylamide gel containing 0.5×TBE and no glycerol. This was run in a 4° C. cold room in 0.5×TBE at 40 watts constant power for 2-3 hours. The gel was transferred to filter paper, dried, and autoradiographed overnight with an intensifying screen.

[0140] Results

[0141] The chromosomal localisation of the gene encoding murine Dtk has been established on chromosome 2 band F using fluorescent in situ hybridisation. This result has been confirmed using single strand conformation polymorphism analysis in the BXD recombinant inbred series.

[0142] The gene encoding human Dtk has been mapped using fluorescent in situ hybridisation to chromosome 15q15.

[0143] C. Recombinant Expression of Dtks of the Invention

[0144] In yet another aspect, the present invention relates to the recombinant expression of the Dtks or of-their extracellular receptor domains.

[0145] As will be exemplified below, the nucleic acid molecules that encode the receptors or the extracellular receptor domains of the invention may be inserted into known vectors for use in standard recombinant DNA techniques. Standard recombinant DNA techniques are those such as are described in Sambrook et al.; “Molecular Cloning” 2nd Edition Cold Spring Harbour Laboratory Press (1987) and by Ausubel et al., Eds, “Current Protocols in Molecular Biology” Greene Publishing Associates and Wiley-Interscience, New York (1987).

[0146] Vectors for expressing proteins in bacteria, especially E. coli, are known. Such vectors include the PATH vectors described by Dieckmann and Tzagoloff in J. Biol. Chem. 260, 1513-1520 (1985). These vectors contain DNA sequences that encode anthranilate synthetase (TrpE) followed by a polylinker at the carboxy terminus. Other expression vector systems are based on beta-galactosidase (pGEX); lambda P maltose binding protein (pMAL); and gluthathione S-transferase (pGST)—see Gene 67, 31(1988) and Peptide Research 3, 167 (1990).

[0147] Vectors useful in yeast are available and well known. A suitable example is the 2μ plasmid.

[0148] Suitable vectors for use in mammalian cells are also known. Such vectors include well-known derivatives of SV40, adenovirus, retrovirus-derived DNA sequences and vectors derived from combination of plasmids and phage DNA.

[0149] Further eucaryotic expression vectors are known in the art (e.g. P. J. Southern and P. Berg, J. Mol. Appl. Genet. 1, 327-341 (1982); S. Subramani et al, Mol. Cell. Biol. 1, 854-864 (1981); R. J. Kaufmann and P. A. Sharp, “Amplification And Expression of Sequences Cotransfected with a Modular Dihydrofolate Reductase Complementary DNA Gene,” J. Mol. Biol. 159, 601-621 (1982); R. J. Kaufmann and P. A. Sharp, Mol. Cell. Biol. 159, 601-664 (1982); S. I. Scahill et al, “Expression And Characterization Of The Product Of A Human Immune Interferon DNA Gene In Chinese Hamster Ovary Cells,” Proc. Natl. Acad. Sci. USA 80, 46544659 (1983); G. Urlaub and L. A. Chasin, Proc. Natl. Acad. Sci. USA 77, 4216-4220, (1980).

[0150] The expression vectors useful in the present invention contain at least one expression control sequence that is operatively linked to the DNA sequence or fragment to be expressed. The control sequence is inserted in the vector in order to control and to regulate the expression of the cloned DNA sequence. Examples of useful expression control sequences are the lac system, the trp system, the tac system, the trc system, major operator and promoter regions of phage lambda, the control region of fd coat protein, the glycolytic promoters of yeast, e.g. the promoter for 3-phosphoglycerate kinase, the promoters of yeast acid phosphatase, e.g. Pho5, the promoters of the yeast alpha-mating factors, and promoters derived from polyoma, adenovirus, retrovirus, and simian virus, e.g. the early and late promoters or SV40, and other sequences known to control the expression of genes of prokaryotic and eucaryotic cells and their viruses or combinations thereof.

[0151] Vectors containing the receptor-encoding DNA and control signals are inserted into a host cell for expression of the receptor. Some useful expression host cells include well-known prokaryotic and eucaryotic cells. Some suitable prokaryotic hosts include, for example, E. coli, such as E. coli SG-936, E. coli HB 101, E. coli W3110, E. coli X1776, E. coli X2282, E. coli DHT, and E. coli MR01, Pseudomonas, Bacillus, such as Bacillus subtilis, and Streptomyces. Suitable eucaryotic cells include yeast and other fungi, insect, animal cells, such as COS cells and CHO cells, human cells and plant cells in tissue culture.

[0152] A specific although non-limiting example of this aspect of the invention is set out below. It will be appreciated that while the expression of murine Dtk is exemplified, the procedures disclosed are equally applicable to the expression of other Dtks, or to the expression of extracellular receptor domains of such Dtks.

[0153] C.1 Expression of Cloned Murine Dtk in Heterologous Cell Lines

[0154] The coding region of murine Dtk was ligated in-frame into the commercially available expression vector pcDNA3 (In Vitrogen) using standard molecular biology techniques. The pcDNA3-Dtk construct was electroporated into several heterologous cell lines to demonstrate expression of Dtk. Electroporation, drug selection and isolation of Dtk-expressing clones for each cell line followed standard techniques (in M. Kriegler, “Gene Transfer and Expression—A Laboratory Manual”, Stockton Press, New York 1990).

[0155] The Dtk construct was expressed in the factor-dependent cell lines FDC-P1, BAF/3 and 32D, and in the NIH 3T3 cell line (all commercially available). The expression of Dtk in these cell lines has been ascertained at the level of RNA using standard techniques for the isolation of RNA and its detection using radiolabelled Dtk probes which are familiar to those experienced in the field (see Sambrook et al., “Molecular Cloning,” Second Edition, supra vide).

[0156] D. Lipands

[0157] The invention also includes ligands that bind to the Dtks of the invention. The ligand may be a protein such as a growth factor that occurs naturally in a mammal, preferably the same mammal that produces the corresponding Dtk. The growth factor may be isolated and purified, or be present on the surface of an isolated population of cells, such as stromal cells.

[0158] The ligand may also be a molecule that does not occur naturally in a mammal. For example, antibodies raised against the receptors of the invention or against anti-ligand antibodies mimic the shape of, and act as, ligands if they constitute the negative image of the receptor or anti-ligand antibody binding site.

[0159] Such antibodies may be polyclonal but are preferably monoclonal. Monoclonal antibodies may be produced by methods known in the art. These methods include the immunological method described by Kohler and Milstein in Nature 256, 495-497 (1975) and Campbell in “Monoclonal Antibody Technology, the Production and Characterization of Rodent and Human Hybridomas” in Burdon et al. Eds, Laboratory Techniques in Biochemistry and Molecular Biology, Volume 13, Elsevier Science Publishers, Amsterdam (1985); as well as by the recombinant DNA method described by Huse et al. in Science 246, 1275-1281 (1989).

[0160] In yet another form, the ligand may also be a non-protein molecule that acts as a ligand when it binds to, or otherwise comes into contact with, the receptor. In addition, ligands may be of two functional types. The first functional type of ligand is a molecule which binds to the receptor and stimulates it in performing its normal function (a “stimulant ligand”). The second functional type of ligand is a molecule which binds to the receptor and inhibits or prevents it performing its normal function (an “antagonistic ligand”).

[0161] Both types of ligand will find application in either therapeutic or prophylactic treatments as described below.

[0162] D.1 Sources of Ligands

[0163] The strategy for isolating a ligand for the Dtks of the invention is based on the assumption that the ligand will either be a soluble, secreted protein or alternatively it will be membrane-bound or associated.

[0164] To screen for soluble ligands, conditioned media from a range of tumor cell lines and tissues can be used. Such cell lines are readily available from the American Type Culture Collection (ATCC) Rockville, Md., USA. Conditioned media is generated from these cell lines using a variety of culture and induction protocols. The cell lines are grown using standard tissue culture techniques which are detailed by ATCC for each cell line. Conditioned medium from tissues is generated by growing minced tissue fragments in culture medium for a defined time period.

[0165] To screen for membrane-associated ligands a different approach is taken. Cell lines in which from tissues which are in close proximity to those cells or tissues which have been shown to express the Dtk receptor are used. This approach is based on the likelihood of close cell-to-cell contact between receptor-expressing cells and ligand-expressing cells. An example of this is in the testis where Sertoli cells express the receptor, while germ cells are considered a likely source of membrane-bound ligand. A further example in the brain would be where one type of neuron expresses the receptor, while microglial cells or another non-neuronal brain cell are considered likely to express the ligand.

[0166] D.2 Ligand Screening Procedures

[0167] In illustrating the screening procedures, reference will be made to murine Dtk as representative of the Dtks of the invention. Equivalent procedures can of course be employed in screening protocols using other mammalian Dtks such as human Dtk or the extracellular receptor domains of such Dtks.

[0168] Two approaches are followed to screen for the ligand for murine Dtk. If the ligand is soluble, assays which utilise either growth responses or changes in tyrosine phosphorylation will be used. Alternatively, if the ligand is membrane-bound, ligand-expressing cells will be detected using a Dtk-tag protein system whereby the extracellular domain of Dtk is fused with sequence encoding part of the human inmnunoglobulin molecule, such as the Fc region or the μ chain. The tag can then be detected using reagents which bind to the tag, such as Protein A-alkaline phosphatase or Protein A-radioiodine¹²⁵.

[0169] D.3 Soluble Ligand

[0170] To detect soluble ligand in the media conditioned by tumor cell lines or tissues, a range of concentrations of this media are added to one of the factor-dependent cell lines described above, that have been transfected with, and express the Dtk receptor. These cell lines are routinely maintained in interleukin-3 containing tissue culture medium. By withdrawing this medium and adding sources of potential ligand for Dtk, a growth response will be sought that is mediated via the introduced Dtk receptor. This response can be detected using the uptake of radiolabelled thymidine and counting this uptake by liquid scintillation spectroscopy. These techniques are standard for those familiar in the art (see Kriegler (supra); and Crosier et al., Proc. Natl. Acad. Sci. USA 88: 7744-8 (1991)).

[0171] An alternative detection system for ligands contained in tumor cell line conditioned medium uses the Dtk-expressing NIH 3T3 cell line as an indicator system, in conjunction with monitoring alterations in tyrosine phosphorylation of the Dtk receptor. Conditioned medium that contains the ligand for Dtk will trigger activation of the receptor which in turn is reflected in the phosphorylation status of the receptor. The system uses standard techniques whereby the NIH 3T3 cells are incubated with conditioned medium, cell lysates produced which in turn are immunoprecipitated with an anti-murine Dtk polyclonal antibody, proteins are resolved on SDS-PAGE gels, followed by transfer to nitrocellulose filters and subsequent Western blotting with an anti-phosphotyrosine antibody and detection using enhanced chemiluminescence techniques. These techniques are standard protein biochemistry methods (see B. Sefton and T. Hunter (eds), “Methods in Enzymology,” vol 200 and 201, 1990; and Amersham, Manufacturer's protocols for ECL techniques). The expected result with this technique would be that potential ligand-containing media would stimulate increased tyrosine phosphorylation, compared with background levels detected in these cells.

[0172] D.4 Membrane-bound Ligand

[0173] Screening for membrane-bound or associated ligands for the Dtk receptor relies on the use of a Dtk-tag fusion protein detection system. The extracellular domain of the Dtk receptor is fused in-frame to the Fc region of human immunoglobulin (IgG) or to part of the human μ chain of IgM. This procedure follows that described by Goodwin et al., Cell 73: 447456 (1993). The fusion protein is produced by transfecting the fused genes contained within the expression pEDδ c vector into COS cells. The fusion protein is purified on Protein A-Sepharose columns (Pharmacia). The Dtk-tag fusion proteins are biotinylated using sulfosuccinimiddyl-6-biotinamido)-hexanoate (Pierce Chemicals) according to the manufacturer's procedures. Alternatively, FITC-conjugated Dtk-tag fusion protein is generated by conjugating the fusion protein to FITC using standard techniques (see Suda et al., Cell 75: 169-1178, 1993).

[0174] The Dtk-tag fusion protein is used to screen for the expression of bound Dtk protein on tumor cell lines using flow cytometric techniques. The techniques used for the labelling of cells and flow cytometric analysis follow those described by Mosley et al., Cell 59: 335-348 (1989). Tumor cells are labelled on ice with the biotinylated Dtk-fusion protein using avidin-FITC, or the FTTC-labelled protein is used directly in FACS analysis. The screening procedure is aimed at detecting a cell line that produces a signal above background with the Dtk-tag fusion protein, compared with an unrelated receptor-tag fusion protein. Sequential FACS sorting of Dtk ligand-expressing cells is undertaken to generate a high Dtk ligand-expressing tumor cell subline which can be used for the generation of a cDNA expression library (for an overview of this strategy see Wong in Genetic Engineering Vol. 12, ed by J K Setlow, 1990).

[0175] E Expression Cloning of the Dtk Ligand

[0176] E.1 Construction of an Expression Library

[0177] A random-primed expression library is constructed from poly(A)⁺ mRNA isolated from the cell line or tissue demonstrated to give a positive signal in either the growth assay, phosphorylation assay or Dtk-tag fusion protein assay outlined above. The techniques used for construction of the expression library are standard procedures for those experienced in the field (see McMahon et al., EMBO J. 10, 2821-2832, 1991; and Kriegler (supra)).

[0178] E.2 Cloning of the Murine or Human Dtk Ligand

[0179] The expression library constructed from the cell line or tissue is screened by transfecting pools of cDNAs into COS cells using standard techniques (see Sambrook et al., supra). Two approaches are used to detect positive pools, depending on whether there has been evidence for either a soluble form of ligand or a membrane-bound form of ligand.

[0180] Soluble forms: COS supernatants are screened in the detection systems outlined above for soluble ligand forms. COS cells are grown in 10 cm plates using standard tissue culture techniques.

[0181] Membrane-bound forms: COS cells are grown in LabTech (Nunc) chambers and positive pools are detected by using the binding of Dtk-tag fusion protein to the COS cells, followed by detection with either a Protein A-horseradish peroxidase enzymatic reaction or Protein A-¹²⁵I binding and subsequent autoradiography.

[0182] Procedures for the breaking down of cDNA pools, subsequent sib selection and the isolation of single cDNA clones are outlined in Sambrook et al., (supra) and Wong (supra). Sequence analysis of single cDNAs follows standard techniques. Once a single cDNA clone is isolated this is transfected into COS cells or into CHO cells for large scale production of protein using standard procedures.

[0183] F Application of Ligands for the Dtks of the Invention

[0184] The types of ligand discussed above can be employed in two distinctive methods in accordance with this invention.

[0185] The first such method is a method of stimulating the proliferation, differentiation and/or survival of a cell expressing a Dtk of the invention. This stimulation, which can occur in vivo or ex vivo, involves contacting the cell with an effective amount of the ligand.

[0186] The ability of a ligand according to the invention to stimulate cells such as stem cells which express the Dtk of the invention has important therapeutic applications. Such applications include medically treating mammals, including humans, whose stem cells do not sufficiently undergo self-renewal. Examples of such medical problems which can be treated in this way include those that occur when defects in haematopoietic stem cells or their related growth factors depress the number of blood cells, leading, to disorders such as aplastic anaemia. The treatment of bone marrow damage resulting from cancer chemotherapy and radiation is another example of a medical problem that could be treated in this way.

[0187] The method can also be applied in stimulating the proliferation, differentiation and/or survival of mammalian fetal or adult neuronal cells or cells that form part of the central nervous system. Again, this has important therapeutic applications. Such applications include treating mammals, including humans, for inherited or degenerative disorders of the central nervous system. An additional application is the treatment of individuals with central nervous system trauma, for example, spinal cord trauma resulting from either crushing or asphyxia.

[0188] Yet a further therapeutic application for the ligands of the invention is in sports medicine, particularly in the treatment of muscle injuries. The Dtk of the invention is abundantly expressed on myoblast cells but not on mature muscle cells. Application of the ligand will stimulate myoblast cell proliferation and differentiation, leading to muscle repair.

[0189] In terms of ex vivo applications, the method has implications for gene therapy. In gene therapy genes are inserted into host cells (such as haematopoietic stem cells and myoblasts) and the expression of the gene regulated by either an endogenous or an exogenous promoter. However, it is often difficult to maintain growth and survival of these cells ex vivo while they are being manipulated for the insertion of foreign genes. Therefore, as the Dtk of the invention is expressed on haematopoietic stem cells and myoblasts, the ligand has a direct application in stimulating the growth, proliferation or simple survival of their cells during the manipulative process.

[0190] The second distinct method of the invention is a method of inhibiting the function of the Dtk of the invention. This method, which would normally be applied in vivo for both prophylactic and therapeutic applications, involves contacting the receptor with a ligand which blocks or prevents stimulation of the receptor (an antagonist jigand).

[0191] In terms of prophylaxis, such a method has specific application to the Sertoli cells of the testis, which abundantly express the receptor. Due to the involvement of these Sertoli cells in male fertility, contacting the receptors with an antagonistic ligand has a potential application in the control of male fertility including in male contraception.

[0192] A potential therapeutic application of contacting cells expressing the Dtk of the invention with an antagonistic ligand is in anti-tumour therapy. This potential application arises from the growing understanding of the role sometimes played by RTKs in tumour formation.

[0193] G Therapeutic Applications of Soluable Receptors

[0194] The extracellular receptor domain of the invention as described above also have potential therapeutic applications. Such applications are in a method of treating a disease, syndrome or condition caused or mediated by an excess of a ligand of the invention (whether stimulant or antagonistic).

[0195] In this method, the extracellular receptor domain of the Dtk in a soluble form can be used as a molecular “sponge” or “sink” to remove the excess of the ligand or at least to block its activity.

[0196] H Functional Equivalents

[0197] The invention includes functional equivalents of the Dtks, receptor domains, nucleic acid molecules and ligands described above.

[0198] The Dtks, extracellular receptor domains and ligands are or include proteins. A protein is considered a functional equivalent of another protein for a specific function if the equivalent protein is immunologically cross-reactive with, and has the same function as, the original protein. The equivalent may, for example, be a fragment of the protein, or a substitution, addition or deletion mutant of the protein.

[0199] For example, it is possible to substitute amino acids in a sequence with equivalent amino acids using conventional techniques. Groups of amino acids known normally to be equivalent are:

[0200] (a) Ala(A) Ser(S) Thr(T) Pro(P) Gly(G);

[0201] (b) Asn(N) Asp(D) Glu(E) Gln(Q);

[0202] (c) His(H) Arg(R) Lys(K);

[0203] (d) Met(M) Leu(L) Ile(I) Val(V); and

[0204] (e) Phe(F) Tyr(Y) Trp(W).

[0205] Substitutions, additions and/or deletions in the receptors and ligands may be made as long as the resulting equivalent receptors and ligands are immunologically cross-reactive with, and have the same function as, the native receptors and ligands.

[0206] The equivalent receptors and ligands will normally have substantially the same amino acid sequence as the native receptors and ligands. An amino acid sequence that is substantially the same as another sequence, but that differs from the other sequence by means of one or more substitutions, additions and/or deletions is considered to be an equivalent sequence. Preferably, less than 25%, more preferably less than 10%, and most preferably less than 5% of the number of amino acid residues in the amino acid sequence of the native receptors and ligands are substituted for, added to, or deleted from.

[0207] Equivalent nucleic acid molecules include nucleic acid sequences that encode equivalent receptors and ligands as defined above. Equivalent nucleic acid molecules also include nucleic acid sequences that, due to the degeneracy of the nucleic acid code, differ from native nucleic acid sequences in ways that do not affect the corresponding amino acid sequences.

[0208] Those persons skilled in the art will of course appreciate that the above description is provided by way of example only and that the invention is limited only by the lawful scope of the appended claims.

1 16 874 AMINO ACIDS AMINO ACID LINEAR PROTEIN 1 Met Gly Trp Pro Gly Leu Arg Pro Leu Leu Leu Ala Gly 1 5 10 Leu Ala Ser Leu Leu Leu Pro Gly Ser Ala Ala Ala Gly 15 20 25 Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 30 35 Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val Glu Gly 40 45 50 Met Glu Asp Pro Asp Ile His Trp Met Lys Asp Gly Thr 55 60 65 Val Val Gln Asn Ala Ser Gln Val Ser Ile Ser Ile Ser 70 75 Glu His Ser Trp Ile Gly Leu Leu Ser Leu Lys Ser Val 80 85 90 Glu Arg Ser Asp Ala Gly Leu Tyr Trp Cys Gln Val Lys 95 100 Asp Gly Glu Glu Thr Lys Ile Ser Gln Ser Val Trp Leu 105 110 115 Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro Lys 120 125 130 Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln Leu Ser 135 140 Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr Ile Tyr 145 150 155 Trp Trp Arg Gly Leu Thr Lys Val Gly Gly Pro Ala Pro 160 165 Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr Gln Arg 170 175 180 Thr Glu Phe Ser Cys Glu Ala Arg Asn Ile Lys Gly Leu 185 190 195 Ala Thr Ser Arg Pro Ala Ile Val Arg Leu Gln Ala Pro 200 205 Pro Ala Ala Pro Phe Asn Thr Thr Val Thr Thr Ile Ser 210 215 220 Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly Ala Asp 225 230 Gly Leu Ala Leu Leu His Ser Cys Thr Val Gln Val Ala 235 240 245 His Ala Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val 250 255 260 Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asn Leu Ala 265 270 Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys Ala Asn 275 280 285 Ala Leu Gly Pro Ser Pro Tyr Gly Asp Trp Val Pro Phe 290 295 Gln Thr Lys Gly Leu Ala Pro Ala Arg Ala Pro Gln Asn 300 305 310 Phe His Ala Ile Arg Thr Asp Ser Gly Leu Ile Leu Glu 315 320 325 Trp Glu Glu Val Ile Pro Glu Asp Pro Gly Glu Gly Pro 330 335 Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Glu Asn Gly 340 345 350 Thr Gln Asp Glu Leu Met Val Glu Gly Thr Arg Ala Asn 355 360 Leu Thr Asp Trp Asp Pro Gln Lys Asp Leu Ile Leu Arg 365 370 375 Val Cys Ala Ser Asn Ala Ile Gly Asp Gly Pro Trp Ser 380 385 390 Gln Pro Leu Val Val Ser Ser His Asp His Ala Gly Arg 395 400 Gln Gly Pro Pro His Ser Arg Thr Ser Trp Val Pro Val 405 410 415 Val Leu Gly Val Leu Thr Ala Leu Ile Thr Ala Ala Ala 420 425 Leu Ala Leu Ile Leu Leu Arg Lys Arg Arg Lys Glu Thr 430 435 440 Arg Phe Gly Gln Ala Phe Asp Ser Val Met Ala Arg Gly 445 450 455 Glu Pro Ala Val His Phe Arg Ala Ala Arg Ser Phe Asn 460 465 Arg Glu Arg Pro Glu Arg Ile Glu Ala Thr Leu Asp Ser 470 475 480 Leu Gly Ile Ser Asp Glu Leu Lys Glu Lys Leu Glu Asp 485 490 Val Leu Ile Pro Glu Gln Gln Phe Thr Leu Gly Arg Met 495 500 505 Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu Ala Gln 510 515 520 Leu Lys Gln Glu Asp Gly Ser Phe Val Lys Val Ala Val 525 530 Lys Met Leu Lys Ala Asp Ile Ile Ala Ser Ser Asp Ile 535 540 545 Glu Glu Phe Leu Arg Glu Ala Ala Cys Met Lys Glu Phe 550 555 Asp His Pro His Val Ala Lys Leu Val Gly Val Ser Leu 560 565 570 Arg Ser Arg Ala Lys Gly Arg Leu Pro Ile Pro Met Val 575 580 585 Ile Leu Pro Phe Met Lys His Gly Asp Leu His Ala Phe 590 595 Leu Leu Ala Ser Arg Ile Gly Glu Asn Pro Phe Asn Leu 600 605 610 Pro Leu Gln Thr Leu Val Arg Phe Met Val Asp Ile Ala 615 620 Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe Ile His 625 630 635 Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala Glu Asp 640 645 650 Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser Arg Lys 655 660 Ile Tyr Ser Gly Asp Tyr Tyr Arg Gln Gly Cys Ala Ser 665 670 675 Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser Leu Ala 680 685 Asp Asn Leu Tyr Thr Val His Ser Asp Val Trp Ala Phe 690 695 700 Gly Val Thr Met Trp Glu Ile Met Thr Arg Gly Gln Thr 705 710 715 Pro Tyr Ala Gly Ile Glu Asn Ala Glu Ile Tyr Asn Tyr 720 725 Leu Ile Gly Gly Asn Arg Leu Lys Gln Pro Pro Glu Cys 730 735 740 Met Glu Glu Val Tyr Asp Leu Met Tyr Gln Cys Trp Ser 745 750 Ala Asp Pro Lys Gln Arg Pro Ser Phe Thr Cys Leu Arg 755 760 765 Met Glu Leu Glu Asn Ile Leu Gly His Leu Ser Val Leu 770 775 780 Ser Thr Ser Gln Asp Pro Leu Tyr Ile Asn Ile Glu Arg 785 790 Ala Glu Gln Pro Thr Glu Ser Gly Ser Pro Glu Leu His 795 800 805 Cys Gly Glu Arg Ser Ser Ser Glu Ala Gly Asp Gly Ser 810 815 Gly Val Gly Ala Val Gly Gly Ile Pro Ser Asp Ser Arg 820 825 830 Tyr Ile Phe Ser Pro Gly Gly Leu Ser Glu Ser Pro Gly 835 840 845 Gln Leu Glu Gln Gln Pro Glu Ser Pro Leu Asn Glu Asn 850 855 Gln Arg Leu Leu Leu Leu Gln Gln Gly Leu Leu Pro His 860 865 870 Ser Ser Cys 850 AMINO ACIDS AMINO ACID LINEAR PROTEIN 2 Ala Gly 1 Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 5 10 15 Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val Glu Gly 20 25 Met Glu Asp Pro Asp Ile His Trp Met Lys Asp Gly Thr 30 35 40 Val Val Gln Asn Ala Ser Gln Val Ser Ile Ser Ile Ser 45 50 Glu His Ser Trp Ile Gly Leu Leu Ser Leu Lys Ser Val 55 60 65 Glu Arg Ser Asp Ala Gly Leu Tyr Trp Cys Gln Val Lys 70 75 80 Asp Gly Glu Glu Thr Lys Ile Ser Gln Ser Val Trp Leu 85 90 Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro Lys 95 100 105 Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln Leu Ser 110 115 Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr Ile Tyr 120 125 130 Trp Trp Arg Gly Leu Thr Lys Val Gly Gly Pro Ala Pro 135 140 145 Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr Gln Arg 150 155 Thr Glu Phe Ser Cys Glu Ala Arg Asn Ile Lys Gly Leu 160 165 170 Ala Thr Ser Arg Pro Ala Ile Val Arg Leu Gln Ala Pro 175 180 Pro Ala Ala Pro Phe Asn Thr Thr Val Thr Thr Ile Ser 185 190 195 Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly Ala Asp 200 205 210 Gly Leu Ala Leu Leu His Ser Cys Thr Val Gln Val Ala 215 220 His Ala Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val 225 230 235 Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asn Leu Ala 240 245 Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys Ala Asn 250 255 260 Ala Leu Gly Pro Ser Pro Tyr Gly Asp Trp Val Pro Phe 265 270 275 Gln Thr Lys Gly Leu Ala Pro Ala Arg Ala Pro Gln Asn 280 285 Phe His Ala Ile Arg Thr Asp Ser Gly Leu Ile Leu Glu 290 295 300 Trp Glu Glu Val Ile Pro Glu Asp Pro Gly Glu Gly Pro 305 310 Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Glu Asn Gly 315 320 325 Thr Gln Asp Glu Leu Met Val Glu Gly Thr Arg Ala Asn 330 335 340 Leu Thr Asp Trp Asp Pro Gln Lys Asp Leu Ile Leu Arg 345 350 Val Cys Ala Ser Asn Ala Ile Gly Asp Gly Pro Trp Ser 355 360 365 Gln Pro Leu Val Val Ser Ser His Asp His Ala Gly Arg 370 375 Gln Gly Pro Pro His Ser Arg Thr Ser Trp Val Pro Val 380 385 390 Val Leu Gly Val Leu Thr Ala Leu Ile Thr Ala Ala Ala 395 400 405 Leu Ala Leu Ile Leu Leu Arg Lys Arg Arg Lys Glu Thr 410 415 Arg Phe Gly Gln Ala Phe Asp Ser Val Met Ala Arg Gly 420 425 430 Glu Pro Ala Val His Phe Arg Ala Ala Arg Ser Phe Asn 435 440 Arg Glu Arg Pro Glu Arg Ile Glu Ala Thr Leu Asp Ser 445 450 455 Leu Gly Ile Ser Asp Glu Leu Lys Glu Lys Leu Glu Asp 460 465 470 Val Leu Ile Pro Glu Gln Gln Phe Thr Leu Gly Arg Met 475 480 Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu Ala Gln 485 490 495 Leu Lys Gln Glu Asp Gly Ser Phe Val Lys Val Ala Val 500 505 Lys Met Leu Lys Ala Asp Ile Ile Ala Ser Ser Asp Ile 510 515 520 Glu Glu Phe Leu Arg Glu Ala Ala Cys Met Lys Glu Phe 525 530 535 Asp His Pro His Val Ala Lys Leu Val Gly Val Ser Leu 540 545 Arg Ser Arg Ala Lys Gly Arg Leu Pro Ile Pro Met Val 550 555 560 Ile Leu Pro Phe Met Lys His Gly Asp Leu His Ala Phe 565 570 Leu Leu Ala Ser Arg Ile Gly Glu Asn Pro Phe Asn Leu 575 580 585 Pro Leu Gln Thr Leu Val Arg Phe Met Val Asp Ile Ala 590 595 600 Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe Ile His 605 610 Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala Glu Asp 615 620 625 Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser Arg Lys 630 635 Ile Tyr Ser Gly Asp Tyr Tyr Arg Gln Gly Cys Ala Ser 640 645 650 Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser Leu Ala 655 660 665 Asp Asn Leu Tyr Thr Val His Ser Asp Val Trp Ala Phe 670 675 Gly Val Thr Met Trp Glu Ile Met Thr Arg Gly Gln Thr 680 685 690 Pro Tyr Ala Gly Ile Glu Asn Ala Glu Ile Tyr Asn Tyr 695 700 Leu Ile Gly Gly Asn Arg Leu Lys Gln Pro Pro Glu Cys 705 710 715 Met Glu Glu Val Tyr Asp Leu Met Tyr Gln Cys Trp Ser 720 725 730 Ala Asp Pro Lys Gln Arg Pro Ser Phe Thr Cys Leu Arg 735 740 Met Glu Leu Glu Asn Ile Leu Gly His Leu Ser Val Leu 745 750 755 Ser Thr Ser Gln Asp Pro Leu Tyr Ile Asn Ile Glu Arg 760 765 Ala Glu Gln Pro Thr Glu Ser Gly Ser Pro Glu Leu His 770 775 780 Cys Gly Glu Arg Ser Ser Ser Glu Ala Gly Asp Gly Ser 785 790 795 Gly Val Gly Ala Val Gly Gly Ile Pro Ser Asp Ser Arg 800 805 Tyr Ile Phe Ser Pro Gly Gly Leu Ser Glu Ser Pro Gly 810 815 820 Gln Leu Glu Gln Gln Pro Glu Ser Pro Leu Asn Glu Asn 825 830 Gln Arg Leu Leu Leu Leu Gln Gln Gly Leu Leu Pro His 835 840 845 Ser Ser Cys 850 876 AMINO ACIDS AMINO ACID LINEAR PROTEIN 3 Met Gly Arg Pro Gly Leu Pro Pro Leu Pro Leu Pro Pro 1 5 10 Pro Pro Arg Leu Gly Leu Leu Leu Ala Glu Ser Ala Ala 15 20 25 Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr 30 35 Val Ser Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val 40 45 50 Glu Gly Met Glu Glu Pro Asp Ile Gln Trp Val Lys Asp 55 60 65 Gly Ala Val Val Gln Asn Leu Asp Gln Leu Tyr Ile Pro 70 75 Val Ser Glu Gln His Trp Ile Gly Phe Leu Ser Leu Lys 80 85 90 Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gln 95 100 Val Glu Asp Gly Gly Glu Thr Glu Ile Ser Gln Pro Val 105 110 115 Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu 120 125 130 Pro Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln 135 140 Leu Ser Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr 145 150 155 Ile Val Trp Trp Arg Gly Thr Thr Lys Ile Gly Gly Pro 160 165 Ala Pro Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr 170 175 180 Gln Ser Thr Met Phe Ser Cys Glu Ala His Asn Leu Lys 185 190 195 Gly Leu Ala Ser Ser Arg Thr Ala Thr Val His Leu Gln 200 205 Ala Leu Pro Ala Ala Pro Phe Asn Ile Thr Val Thr Lys 210 215 220 Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 225 230 Ala Asp Gly Arg Ala Leu Leu Gln Ser Cys Thr Val Gln 235 240 245 Val Thr Gln Ala Pro Gly Gly Trp Glu Val Leu Ala Val 250 255 260 Val Val Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asp 265 270 Leu Val Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys 275 280 285 Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp Val 290 295 Pro Phe Gln Thr Lys Gly Leu Ala Pro Ala Ser Ala Pro 300 305 310 Gln Asn Leu His Ala Ile Arg Thr Asp Ser Gly Leu Ile 315 320 325 Leu Glu Trp Glu Glu Val Ile Pro Glu Ala Pro Leu Glu 330 335 Gly Pro Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Asp 340 345 350 Asn Gly Thr Gln Asp Glu Leu Thr Val Glu Gly Thr Arg 355 360 Ala Asn Leu Thr Gly Trp Asp Pro Gln Lys Asp Leu Ile 365 370 375 Val Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro 380 385 390 Trp Ser Gln Pro Leu Val Val Ser Ser His Asp Arg Ala 395 400 Gly Gln Gln Gly Pro Pro His Ser Arg Thr Ser Trp Val 405 410 415 Pro Val Val Leu Gly Val Leu Thr Ala Leu Val Thr Ala 420 425 Ala Ala Leu Ala Leu Ile Leu Leu Arg Lys Arg Arg Lys 430 435 440 Glu Thr Arg Phe Gly Gln Ala Phe Asp Ser Val Met Ala 445 450 455 Arg Gly Glu Pro Ala Val His Phe Arg Ala Ala Arg Ser 460 465 Phe Asn Arg Glu Arg Pro Glu Arg Ile Glu Ala Thr Leu 470 475 480 Asp Ser Leu Gly Ile Ser Asp Glu Leu Lys Glu Lys Leu 485 490 Glu Asp Val Leu Ile Pro Glu Gln Gln Phe Thr Leu Gly 495 500 505 Arg Met Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu 510 515 520 Ala Gln Leu Lys Gln Glu Asp Gly Ser Phe Val Lys Val 525 530 Ala Val Lys Met Leu Lys Ala Asp Ile Ile Ala Ser Ser 535 540 545 Asp Ile Glu Glu Phe Leu Arg Glu Ala Ala Cys Met Lys 550 555 Glu Phe Asp His Pro His Val Ala Lys Leu Val Gly Val 560 565 570 Ser Leu Arg Ser Arg Ala Lys Gly Arg Leu Pro Ile Pro 575 580 585 Met Val Ile Leu Pro Phe Met Lys His Gly Asp Leu His 590 595 Ala Phe Leu Leu Ala Ser Arg Ile Gly Glu Asn Pro Phe 600 605 610 Asn Leu Pro Leu Gln Thr Leu Ile Arg Phe Met Val Asp 615 620 Ile Ala Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe 625 630 635 Ile His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala 640 645 650 Glu Asp Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser 655 660 Arg Lys Ile Tyr Ser Gly Asp Tyr Tyr Arg Gln Gly Cys 665 670 675 Ala Ser Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser 680 685 Leu Ala Asp Asn Leu Tyr Thr Val Gln Ser Asp Val Trp 690 695 700 Ala Phe Gly Val Thr Met Trp Glu Ile Met Thr Arg Gly 705 710 715 Gln Thr Pro Tyr Ala Gly Ile Glu Asn Ala Glu Ile Tyr 720 725 Asn Tyr Leu Ile Gly Gly Asn Arg Leu Lys Gln Pro Pro 730 735 740 Glu Cys Met Glu Asp Val Tyr Asp Leu Met Tyr Gln Cys 745 750 Trp Ser Ala Asp Pro Lys Gln Arg Pro Ser Phe Thr Cys 755 760 765 Leu Arg Met Glu Leu Glu Asn Ile Leu Gly Gln Leu Ser 770 775 780 Val Leu Ser Ala Ser Gln Asp Pro Leu Tyr Ile Asn Ile 785 790 Glu Arg Ala Glu Glu Pro Thr Val Gly Gly Ser Leu Glu 795 800 805 Leu Pro Gly Arg Asp Gln Pro Tyr Ser Gly Ala Gly Asp 810 815 Gly Ser Gly Met Gly Ala Val Gly Gly Thr Pro Ser Asp 820 825 830 Cys Arg Tyr Ile Leu Thr Pro Gly Gly Leu Ala Glu Gln 835 840 845 Pro Gly Gln Ala Glu His Gln Pro Glu Ser Pro Leu Asn 850 855 Glu Thr Gln Arg Leu Leu Leu Leu Gln Gln Gly Leu Leu 860 865 870 Pro His Ser Ser Cys 875 850 AMINO ACIDS AMINO ACID LINEAR PROTEIN 4 Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr 1 5 10 Val Ser Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val 15 20 25 Glu Gly Met Glu Glu Pro Asp Ile Gln Trp Val Lys Asp 30 35 Gly Ala Val Val Gln Asn Leu Asp Gln Leu Tyr Ile Pro 40 45 50 Val Ser Glu Gln His Trp Ile Gly Phe Leu Ser Leu Lys 55 60 65 Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gln 70 75 Val Glu Asp Gly Gly Glu Thr Glu Ile Ser Gln Pro Val 80 85 90 Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu 95 100 Pro Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln 105 110 115 Leu Ser Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr 120 125 130 Ile Val Trp Trp Arg Gly Thr Thr Lys Ile Gly Gly Pro 135 140 Ala Pro Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr 145 150 155 Gln Ser Thr Met Phe Ser Cys Glu Ala His Asn Leu Lys 160 165 Gly Leu Ala Ser Ser Arg Thr Ala Thr Val His Leu Gln 170 175 180 Ala Leu Pro Ala Ala Pro Phe Asn Ile Thr Val Thr Lys 185 190 195 Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 200 205 Ala Asp Gly Arg Ala Leu Leu Gln Ser Cys Thr Val Gln 210 215 220 Val Thr Gln Ala Pro Gly Gly Trp Glu Val Leu Ala Val 225 230 Val Val Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asp 235 240 245 Leu Val Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys 250 255 260 Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp Val 265 270 Pro Phe Gln Thr Lys Gly Leu Ala Pro Ala Ser Ala Pro 275 280 285 Gln Asn Leu His Ala Ile Arg Thr Asp Ser Gly Leu Ile 290 295 Leu Glu Trp Glu Glu Val Ile Pro Glu Ala Pro Leu Glu 300 305 310 Gly Pro Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Asp 315 320 325 Asn Gly Thr Gln Asp Glu Leu Thr Val Glu Gly Thr Arg 330 335 Ala Asn Leu Thr Gly Trp Asp Pro Gln Lys Asp Leu Ile 340 345 350 Val Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro 355 360 Trp Ser Gln Pro Leu Val Val Ser Ser His Asp Arg Ala 365 370 375 Gly Gln Gln Gly Pro Pro His Ser Arg Thr Ser Trp Val 380 385 390 Pro Val Val Leu Gly Val Leu Thr Ala Leu Val Thr Ala 395 400 Ala Ala Leu Ala Leu Ile Leu Leu Arg Lys Arg Arg Lys 405 410 415 Glu Thr Arg Phe Gly Gln Ala Phe Asp Ser Val Met Ala 420 425 Arg Gly Glu Pro Ala Val His Phe Arg Ala Ala Arg Ser 430 435 440 Phe Asn Arg Glu Arg Pro Glu Arg Ile Glu Ala Thr Leu 445 450 455 Asp Ser Leu Gly Ile Ser Asp Glu Leu Lys Glu Lys Leu 460 465 Glu Asp Val Leu Ile Pro Glu Gln Gln Phe Thr Leu Gly 470 475 480 Arg Met Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu 485 490 Ala Gln Leu Lys Gln Glu Asp Gly Ser Phe Val Lys Val 495 500 505 Ala Val Lys Met Leu Lys Ala Asp Ile Ile Ala Ser Ser 510 515 520 Asp Ile Glu Glu Phe Leu Arg Glu Ala Ala Cys Met Lys 525 530 Glu Phe Asp His Pro His Val Ala Lys Leu Val Gly Val 535 540 545 Ser Leu Arg Ser Arg Ala Lys Gly Arg Leu Pro Ile Pro 550 555 Met Val Ile Leu Pro Phe Met Lys His Gly Asp Leu His 560 565 570 Ala Phe Leu Leu Ala Ser Arg Ile Gly Glu Asn Pro Phe 575 580 585 Asn Leu Pro Leu Gln Thr Leu Ile Arg Phe Met Val Asp 590 595 Ile Ala Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe 600 605 610 Ile His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala 615 620 Glu Asp Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser 625 630 635 Arg Lys Ile Tyr Ser Gly Asp Tyr Tyr Arg Gln Gly Cys 640 645 650 Ala Ser Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser 655 660 Leu Ala Asp Asn Leu Tyr Thr Val Gln Ser Asp Val Trp 665 670 675 Ala Phe Gly Val Thr Met Trp Glu Ile Met Thr Arg Gly 680 685 Gln Thr Pro Tyr Ala Gly Ile Glu Asn Ala Glu Ile Tyr 690 695 700 Asn Tyr Leu Ile Gly Gly Asn Arg Leu Lys Gln Pro Pro 705 710 715 Glu Cys Met Glu Asp Val Tyr Asp Leu Met Tyr Gln Cys 720 725 Trp Ser Ala Asp Pro Lys Gln Arg Pro Ser Phe Thr Cys 730 735 740 Leu Arg Met Glu Leu Glu Asn Ile Leu Gly Gln Leu Ser 745 750 Val Leu Ser Ala Ser Gln Asp Pro Leu Tyr Ile Asn Ile 755 760 765 Glu Arg Ala Glu Glu Pro Thr Val Gly Gly Ser Leu Glu 770 775 780 Leu Pro Gly Arg Asp Gln Pro Tyr Ser Gly Ala Gly Asp 785 790 Gly Ser Gly Met Gly Ala Val Gly Gly Thr Pro Ser Asp 795 800 805 Cys Arg Tyr Ile Leu Thr Pro Gly Gly Leu Ala Glu Gln 810 815 Pro Gly Gln Ala Glu His Gln Pro Glu Ser Pro Leu Asn 820 825 830 Glu Thr Gln Arg Leu Leu Leu Leu Gln Gln Gly Leu Leu 835 840 845 Pro His Ser Ser Cys 850 386 AMINO ACIDS AMINO ACID LINEAR PROTEIN 5 Ala Gly 1 Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 5 10 15 Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val Glu Gly 20 25 Met Glu Asp Pro Asp Ile His Trp Met Lys Asp Gly Thr 30 35 40 Val Val Gln Asn Ala Ser Gln Val Ser Ile Ser Ile Ser 45 50 Glu His Ser Trp Ile Gly Leu Leu Ser Leu Lys Ser Val 55 60 65 Glu Arg Ser Asp Ala Gly Leu Tyr Trp Cys Gln Val Lys 70 75 80 Asp Gly Glu Glu Thr Lys Ile Ser Gln Ser Val Trp Leu 85 90 Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro Lys 95 100 105 Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln Leu Ser 110 115 Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr Ile Tyr 120 125 130 Trp Trp Arg Gly Leu Thr Lys Val Gly Gly Pro Ala Pro 135 140 145 Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr Gln Arg 150 155 Thr Glu Phe Ser Cys Glu Ala Arg Asn Ile Lys Gly Leu 160 165 170 Ala Thr Ser Arg Pro Ala Ile Val Arg Leu Gln Ala Pro 175 180 Pro Ala Ala Pro Phe Asn Thr Thr Val Thr Thr Ile Ser 185 190 195 Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly Ala Asp 200 205 210 Gly Leu Ala Leu Leu His Ser Cys Thr Val Gln Val Ala 215 220 His Ala Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val 225 230 235 Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asn Leu Ala 240 245 Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys Ala Asn 250 255 260 Ala Leu Gly Pro Ser Pro Tyr Gly Asp Trp Val Pro Phe 265 270 275 Gln Thr Lys Gly Leu Ala Pro Ala Arg Ala Pro Gln Asn 280 285 Phe His Ala Ile Arg Thr Asp Ser Gly Leu Ile Leu Glu 290 295 300 Trp Glu Glu Val Ile Pro Glu Asp Pro Gly Glu Gly Pro 305 310 Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Glu Asn Gly 315 320 325 Thr Gln Asp Glu Leu Met Val Glu Gly Thr Arg Ala Asn 330 335 340 Leu Thr Asp Trp Asp Pro Gln Lys Asp Leu Ile Leu Arg 345 350 Val Cys Ala Ser Asn Ala Ile Gly Asp Gly Pro Trp Ser 355 360 365 Gln Pro Leu Val Val Ser Ser His Asp His Ala Gly Arg 370 375 Gln Gly Pro Pro His Ser Arg 380 385 386 AMINO ACIDS AMINO ACID LINEAR PROTEIN 6 Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr 1 5 10 Val Ser Gln Gly Gln Pro Val Lys Leu Asn Cys Ser Val 15 20 25 Glu Gly Met Glu Glu Pro Asp Ile Gln Trp Val Lys Asp 30 35 Gly Ala Val Val Gln Asn Leu Asp Gln Leu Tyr Ile Pro 40 45 50 Val Ser Glu Gln His Trp Ile Gly Phe Leu Ser Leu Lys 55 60 65 Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gln 70 75 Val Glu Asp Gly Gly Glu Thr Glu Ile Ser Gln Pro Val 80 85 90 Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu 95 100 Pro Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gln 105 110 115 Leu Ser Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr 120 125 130 Ile Val Trp Trp Arg Gly Thr Thr Lys Ile Gly Gly Pro 135 140 Ala Pro Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr 145 150 155 Gln Ser Thr Met Phe Ser Cys Glu Ala His Asn Leu Lys 160 165 Gly Leu Ala Ser Ser Arg Thr Ala Thr Val His Leu Gln 170 175 180 Ala Leu Pro Ala Ala Pro Phe Asn Ile Thr Val Thr Lys 185 190 195 Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 200 205 Ala Asp Gly Arg Ala Leu Leu Gln Ser Cys Thr Val Gln 210 215 220 Val Thr Gln Ala Pro Gly Gly Trp Glu Val Leu Ala Val 225 230 Val Val Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asp 235 240 245 Leu Val Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys 250 255 260 Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp Val 265 270 Pro Phe Gln Thr Lys Gly Leu Ala Pro Ala Ser Ala Pro 275 280 285 Gln Asn Leu His Ala Ile Arg Thr Asp Ser Gly Leu Ile 290 295 Leu Glu Trp Glu Glu Val Ile Pro Glu Ala Pro Leu Glu 300 305 310 Gly Pro Leu Gly Pro Tyr Lys Leu Ser Trp Val Gln Asp 315 320 325 Asn Gly Thr Gln Asp Glu Leu Thr Val Glu Gly Thr Arg 330 335 Ala Asn Leu Thr Gly Trp Asp Pro Gln Lys Asp Leu Ile 340 345 350 Val Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro 355 360 Trp Ser Gln Pro Leu Val Val Ser Ser His Asp Arg Ala 365 370 375 Gly Gln Gln Gly Pro Pro His Ser Arg 380 385 3919 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 7 GGCACGAGTG TGGAAGGAGC GCGGTGGCCC AGCCGCAGCC CCGGGGACTC CTCGCTGCTG 60 ACGGCGGTGG CCGCGGCTCT AGGCGGCCGC GGGTCCCGGA CGCCCCGGCC GAGCGCCGCC 120 CCCCGCCCCT CCCGCGGGCC TCCCGCCCCT CCTCCGCCAC CCTCCTCTCA GCGCTCGCGG 180 GCCGGGCCCG GCATGGTGCG GCGTCGCCGC CGATGGCGCT GAGGCGGAGC ATGGGGTGGC 240 CGGGGCTCCG GCCGCTGCTG CTGGCGGGAC TGGCTTCTCT GCTGCTCCCC GGGTCTGCGG 300 CCGCAGGCCT GAAGCTCATG GGCGCCCCAG TGAAGATGAC CGTGTCTCAG GGGCAGCCAG 360 TGAAGCTCAA CTGCAGCGTG GAGGGGATGG AGGACCCTGA CATCCACTGG ATGAAGGATG 420 GCACCGTGGT CCAGAATGCA AGCCAGGTGT CCATCTCCAT CAGCGAGCAC AGCTGGATTG 480 GCTTACTCAG CCTAAAGTCA GTGGAGCGGT CTGATGCTGG CCTGTACTGG TGCCAGGTGA 540 AGGATGGGGA GGAAACCAAG ATCTCTCAGT CAGTATGGCT CACTGTCGAA GGTGTGCCAT 600 TCTTCACAGT GGAACCAAAA GATCTGGCGG TGCCACCCAA TGCCCCTTTT CAGCTGTCTT 660 GTGAGGCTGT GGGTCCTCCA GAACCCGTAA CCATTTACTG GTGGAGAGGA CTCACTAAAG 720 TTGGGGGACC TGCTCCCTCT CCCTCTGTTT TAAATGTGAC AGGAGTGACC CAGCGCACAG 780 AGTTTTCTTG TGAAGCCCGC AACATAAAAG GCCTGGCCAC TTCCCGACCA GCCATTGTTC 840 GCCTTCAAGC ACCGCCTGCA GCTCCTTTCA ACACCACAGT AACAACGATC TCCAGCTACA 900 ACGCTAGCGT GGCCTGGGTG CCAGGTGCTG ACGGCCTAGC TCTGCTGCAT TCCTGTACTG 960 TACAGGTGGC ACACGCCCCA GGAGAATGGG AGGCCCTTGC TGTTGTGGTT CCTGTGCCAC 1020 CTTTTACCTG CCTGCTTCGG AACTTGGCCC CTGCCACCAA CTACAGCCTT AGGGTGCGCT 1080 GTGCCAATGC CTTGGGCCCT TCTCCCTACG GCGACTGGGT GCCCTTTCAG ACAAAGGGCC 1140 TAGCGCCAGC CAGAGCTCCT CAGAATTTCC ATGCCATTCG TACCGACTCA GGCCTTATCC 1200 TGGAATGGGA AGAAGTGATT CCTGAGGACC CTGGGGAAGG CCCCCTAGGA CCTTATAAGC 1260 TGTCCTGGGT CCAAGAAAAT GGAACCCAGG ATGAGCTGAT GGTGGAAGGG ACCAGGGCCA 1320 ATCTGACCGA CTGGGATCCC CAGAAGGACC TGATTTTGCG TGTGTGTGCC TCCAATGCAA 1380 TTGGTGATGG GCCCTGGAGT CAGCCACTGG TGGTGTCTTC TCATGACCAT GCAGGGAGGC 1440 AGGGCCCTCC CCACAGCCGC ACATCCTGGG TGCCTGTGGT CCTGGGCGTG CTCACCGCCC 1500 TGATCACAGC TGCTGCCTTG GCCCTCATCC TGCTTCGGAA GAGACGCAAG GAGACGCGTT 1560 TCGGGCAAGC CTTTGACAGT GTCATGGCCC GAGGGGAGCC AGCTGTACAC TTCCGGGCAG 1620 CCCGATCTTT CAATCGAGAA AGGCCTGAAC GCATTGAGGC CACATTGGAT AGCCTGGGCA 1680 TCAGCGATGA ATTGAAGGAA AAGCTGGAGG ATGTCCTCAT TCCAGAGCAG CAGTTCACCC 1740 TCGGTCGGAT GTTGGGCAAA GGAGAGTTTG GATCAGTGCG GGAAGCCCAG CTAAAGCAGG 1800 AAGATGGCTC CTTCGTGAAA GTGGCAGTGA AGATGCTGAA AGCTGACATC ATTGCCTCAA 1860 GCGACATAGA AGAGTTCCTC CGGGAAGCAG CTTGCATGAA GGAGTTTGAC CATCCACACG 1920 TGGCCAAGCT TGTTGGGGTG AGCCTCCGGA GCAGGGCTAA AGGTCGTCTC CCCATTCCCA 1980 TGGTCATCCT GCCCTTCATG AAACATGGAG ACTTGCACGC CTTTCTGCTC GCCTCCCGAA 2040 TCGGGGAGAA CCCTTTTAAC CTGCCCCTGC AGACCCTGGT CCGGTTCATG GTGGACATTG 2100 CCTGTGGCAT GGAGTACCTG AGCTCCCGGA ACTTCATCCA CCGAGACCTA GCAGCTCGGA 2160 ATTGCATGCT GGCCGAGGAC ATGACAGTGT GTGTGGCTGA TTTTGGACTC TCTCGGAAAA 2220 TCTATAGCGG GGACTATTAT CGTCAGGGCT GTGCCTCCAA ATTGCCCGTC AAGTGGCTGG 2280 CCCTGGAGAG CTTGGCTGAC AACTTGTATA CTGTACACAG TGATGTGTGG GCCTTCGGGG 2340 TGACCATGTG GGAGATCATG ACTCGTGGGC AGACGCCATA TGCTGGCATT GAAAATGCTG 2400 AGATTTACAA CTACCTCATC GGCGGGAACC GCCTGAAGCA GCCTCCGGAG TGCATGGAGG 2460 AAGTGTATGA TCTCATGTAC CAGTGCTGGA GCGCCGACCC CAAGCAGCGC CCAAGCTTCA 2520 CGTGTCTGCG AATGGAACTG GAGAACATTC TGGGCCACCT GTCTGTGCTG TCCACCAGCC 2580 AGGACCCCTT GTACATCAAC ATTGAGAGAG CTGAGCAGCC TACTGAGAGT GGCAGCCCTG 2640 AGCTGCACTG TGGAGAGCGA TCCAGCAGCG AGGCAGGGGA CGGCAGTGGC GTGGGGGCAG 2700 TAGGTGGCAT CCCCAGTGAC TCTCGGTACA TCTTCAGCCC CGGAGGGCTA TCCGAGTCAC 2760 CAGGGCAGCT GGAGCAGCAG CCAGAAAGCC CCCTCAATGA GAACCAGAGG CTGTTGTTGC 2820 TGCAGCAAGG GCTACTGCCT CACAGTAGCT GTTAACCCTC AGGCAGAGGA AAGTTGGGGC 2880 CCCTGGCTCT GCTGACCACT GTGCTGCCTG ACTAGGCCCA GTCTGATCAC AGCCCAGGCA 2940 GCAAGGTATG GAGGCTCCTG TGGTAGCCCT CCCAAGCTGT GCTGGCGCCT GGACGGACCA 3000 AATTGCCCAA TCCCAGTTCT TCCTGCAGCC GCTCTGGCCA GCCTGGCATC AGTTCAGGCC 3060 TTGGCTTAGA GGAGGTGAGC CAGAGCTGGT TGCCTGAATG CAGGCAGCTG GCAGGAGGGG 3120 AGGGTGGCTA TGTTTCCATG GGTACCATGG GTGTGGATGG CAGTAAGGGA GGGTAGCAAC 3180 AGCCCTGTGG GCCCCTACCC TCCTGGCTGA GCTGCTCCTA CTTTAGTGCA TGCTTGGAGC 3240 CGCCTGCAGC CTGGAACTCA GCACTGCCCA CCACACTTGG GCCGAAATGC CAGGTTTGCC 3300 CCTCTTAAGT CACAAAGAGA TGTCCATGTA TTGTTCCCTT TTAGGTGATG ATTAGGAAGG 3360 GATTGGCACA CTTGGGTCCC TAAGCCCTAT GGCAGGAAAT GGTGGGATAT TCTCAGGTCT 3420 GAATCCTCAT CATCTTCCTG ATTCCCCACC CTGCAAAGGC CTGGAACTGG CTGTGGGGCT 3480 CTGAGGCATG CTGAAGGACA AAAGATTACA GAGATCCGAC TTCAAAAGGC AGGGTCTGAG 3540 TCTGGCAGGT GGAGAGGTGC TAAGGGGCTG GCCCAGGAGT CAGGCATTTC AGGACCCCTC 3600 CAAGCTTCTA CAGTCTGTCT GAGCATGCTA CCAAGCCCCC AGATACCCCA AAACTAACAG 3660 AGGCAGTTTT GTCTGAGCCC AGCCCTCCCA CATGATGACC CTTAGGTCTA CCCTCCTCTC 3720 TAAATGGACA TCCTCGTTTG TCCCAAGTCT CCAGAGAGAC TACTGATGGC TGATGTGGGT 3780 AAGAAAAGTT CCAGGAACCA GGGCTGGGGT GGAACCAGGG CTGGGGTCGA GGCAGGCTCT 3840 TGGGCAGGCT CTTGCTGTTA GGAACATTTC TAAGCTATTA AGTTGCTGTT TCAAAACAAA 3900 TAAAATTGAA ACATAAAGA 3919 2550 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 8 GCAGGCCTGA AGCTCATGGG CGCCCCAGTG AAGATGACCG TGTCTCAGGG GCAGCCAGTG 60 AAGCTCAACT GCAGCGTGGA GGGGATGGAG GACCCTGACA TCCACTGGAT GAAGGATGGC 120 ACCGTGGTCC AGAATGCAAG CCAGGTGTCC ATCTCCATCA GCGAGCACAG CTGGATTGGC 180 TTACTCAGCC TAAAGTCAGT GGAGCGGTCT GATGCTGGCC TGTACTGGTG CCAGGTGAAG 240 GATGGGGAGG AAACCAAGAT CTCTCAGTCA GTATGGCTCA CTGTCGAAGG TGTGCCATTC 300 TTCACAGTGG AACCAAAAGA TCTGGCGGTG CCACCCAATG CCCCTTTTCA GCTGTCTTGT 360 GAGGCTGTGG GTCCTCCAGA ACCCGTAACC ATTTACTGGT GGAGAGGACT CACTAAAGTT 420 GGGGGACCTG CTCCCTCTCC CTCTGTTTTA AATGTGACAG GAGTGACCCA GCGCACAGAG 480 TTTTCTTGTG AAGCCCGCAA CATAAAAGGC CTGGCCACTT CCCGACCAGC CATTGTTCGC 540 CTTCAAGCAC CGCCTGCAGC TCCTTTCAAC ACCACAGTAA CAACGATCTC CAGCTACAAC 600 GCTAGCGTGG CCTGGGTGCC AGGTGCTGAC GGCCTAGCTC TGCTGCATTC CTGTACTGTA 660 CAGGTGGCAC ACGCCCCAGG AGAATGGGAG GCCCTTGCTG TTGTGGTTCC TGTGCCACCT 720 TTTACCTGCC TGCTTCGGAA CTTGGCCCCT GCCACCAACT ACAGCCTTAG GGTGCGCTGT 780 GCCAATGCCT TGGGCCCTTC TCCCTACGGC GACTGGGTGC CCTTTCAGAC AAAGGGCCTA 840 GCGCCAGCCA GAGCTCCTCA GAATTTCCAT GCCATTCGTA CCGACTCAGG CCTTATCCTG 900 GAATGGGAAG AAGTGATTCC TGAGGACCCT GGGGAAGGCC CCCTAGGACC TTATAAGCTG 960 TCCTGGGTCC AAGAAAATGG AACCCAGGAT GAGCTGATGG TGGAAGGGAC CAGGGCCAAT 1020 CTGACCGACT GGGATCCCCA GAAGGACCTG ATTTTGCGTG TGTGTGCCTC CAATGCAATT 1080 GGTGATGGGC CCTGGAGTCA GCCACTGGTG GTGTCTTCTC ATGACCATGC AGGGAGGCAG 1140 GGCCCTCCCC ACAGCCGCAC ATCCTGGGTG CCTGTGGTCC TGGGCGTGCT CACCGCCCTG 1200 ATCACAGCTG CTGCCTTGGC CCTCATCCTG CTTCGGAAGA GACGCAAGGA GACGCGTTTC 1260 GGGCAAGCCT TTGACAGTGT CATGGCCCGA GGGGAGCCAG CTGTACACTT CCGGGCAGCC 1320 CGATCTTTCA ATCGAGAAAG GCCTGAACGC ATTGAGGCCA CATTGGATAG CCTGGGCATC 1380 AGCGATGAAT TGAAGGAAAA GCTGGAGGAT GTCCTCATTC CAGAGCAGCA GTTCACCCTC 1440 GGTCGGATGT TGGGCAAAGG AGAGTTTGGA TCAGTGCGGG AAGCCCAGCT AAAGCAGGAA 1500 GATGGCTCCT TCGTGAAAGT GGCAGTGAAG ATGCTGAAAG CTGACATCAT TGCCTCAAGC 1560 GACATAGAAG AGTTCCTCCG GGAAGCAGCT TGCATGAAGG AGTTTGACCA TCCACACGTG 1620 GCCAAGCTTG TTGGGGTGAG CCTCCGGAGC AGGGCTAAAG GTCGTCTCCC CATTCCCATG 1680 GTCATCCTGC CCTTCATGAA ACATGGAGAC TTGCACGCCT TTCTGCTCGC CTCCCGAATC 1740 GGGGAGAACC CTTTTAACCT GCCCCTGCAG ACCCTGGTCC GGTTCATGGT GGACATTGCC 1800 TGTGGCATGG AGTACCTGAG CTCCCGGAAC TTCATCCACC GAGACCTAGC AGCTCGGAAT 1860 TGCATGCTGG CCGAGGACAT GACAGTGTGT GTGGCTGATT TTGGACTCTC TCGGAAAATC 1920 TATAGCGGGG ACTATTATCG TCAGGGCTGT GCCTCCAAAT TGCCCGTCAA GTGGCTGGCC 1980 CTGGAGAGCT TGGCTGACAA CTTGTATACT GTACACAGTG ATGTGTGGGC CTTCGGGGTG 2040 ACCATGTGGG AGATCATGAC TCGTGGGCAG ACGCCATATG CTGGCATTGA AAATGCTGAG 2100 ATTTACAACT ACCTCATCGG CGGGAACCGC CTGAAGCAGC CTCCGGAGTG CATGGAGGAA 2160 GTGTATGATC TCATGTACCA GTGCTGGAGC GCCGACCCCA AGCAGCGCCC AAGCTTCACG 2220 TGTCTGCGAA TGGAACTGGA GAACATTCTG GGCCACCTGT CTGTGCTGTC CACCAGCCAG 2280 GACCCCTTGT ACATCAACAT TGAGAGAGCT GAGCAGCCTA CTGAGAGTGG CAGCCCTGAG 2340 CTGCACTGTG GAGAGCGATC CAGCAGCGAG GCAGGGGACG GCAGTGGCGT GGGGGCAGTA 2400 GGTGGCATCC CCAGTGACTC TCGGTACATC TTCAGCCCCG GAGGGCTATC CGAGTCACCA 2460 GGGCAGCTGG AGCAGCAGCC AGAAAGCCCC CTCAATGAGA ACCAGAGGCT GTTGTTGCTG 2520 CAGCAAGGGC TACTGCCTCA CAGTAGCTGT 2550 4364 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 9 CATTAGATCT TTACATGAAA GTAAAATTTA TAAGATTTCT AGAAAGTCAA AAGATGATAA 60 CTATTTCTTA GGATACTAAA AGCACTCACA TTATAGAAAA AAAATCAGTT AACTATACTC 120 CACAAACATT AAAGGCTCCC TATAAAAAAA CATTTTTAAT AGGCAAGCCA CAGAAAGGGC 180 AAATATTAAT AGTTTGCAAT ACATATGTAT GAAAAGGAAT TGAATCTAGA ATATTTAACA 240 AAGCTTTACA ACTCAAAAAA TACAAAGAAA ATATTTTTCT TCCAATTGGC AAATTACTTA 300 AACAGAACCT TCACAAAAGA AGATAAGAAT GTTTAATAAA CATTTGAAGC CATAATAATG 360 ACATCATTAG CCATGATGGA AATGCAAATT TAAGTACCAC TTCACATCCA CAAGAAAAAG 420 ATAAAAATAA AAGGACTGAG CTCACCAAAC ATTGGTGAGG ATGTGGTAAT ACTGAAATTC 480 TTGTACCGTG CTCCTGAGGG TATAACATAT TACAGGATTT TTTTGAAAAC TAGTGGTTCC 540 TTATAAACTT AATGCCCTGG CAACCTCACA CCTATTTACT TAAGAATGAA AGGGCCCCGC 600 CCTCCTCCCT CCTCGCTCGC GGGCCGGGCC CGGCATGGTG CGGCGTCGCC GCCGATGGCG 660 CTGAGGCGGA GCATGGGGCG GCCGGGGCTC CCGCCGCTGC CGCTGCCGCC GCCACCGCGG 720 CTCGGGCTGC TGCTGGCGGA GTCCGCCGCC GCAGGTCTGA AGCTCATGGG AGCCCCGGTG 780 AAGCTGACAG TGTCTCAGGG GCAGCCGGTG AAGCTCAACT GCAGTGTGGA GGGGATGGAG 840 GAGCCTGACA TCCAGTGGGT GAAGGATGGG GCTGTGGTCC AGAACTTGGA CCAGTTGTAC 900 ATCCCAGTCA GCGAGCAGCA CTGGATCGGC TTCCTCAGCC TGAAGTCAGT GGAGCGCTCT 960 GACGCCGGCC GGTACTGGTG CCAGGTGGAG GATGGGGGTG AAACCGAGAT CTCCCAGCCA 1020 GTGTGGCTCA CGGTAGAAGG TGTGCCATTT TTCACAGTGG AGCCAAAAGA TCTGGCAGTG 1080 CCACCCAATG CCCCTTTCCA ACTGTCTTGT GAGGCTGTGG GTCCCCCTGA ACCTGTTACC 1140 ATTGTCTGGT GGAGAGGAAC TACGAAGATC GGGGGACCCG CTCCCTCTCC ATCTGTTTTA 1200 AATGTAACAG GGGTGACCCA GAGCACCATG TTTTCCTGTG AAGCTCACAA CCTAAAAGGC 1260 CTGGCCTCTT CTCGCACAGC CACTGTTCAC CTTCAAGCAC TGCCTGCAGC CCCCTTCAAC 1320 ATCACCGTGA CAAAGCTTTC CAGCAGCAAC GCTAGTGTGG CCTGGATGCC AGGTGCTGAT 1380 GGCCGAGCTC TGCTACAGTC CTGTACAGTT CAGGTGACAC AGGCCCCAGG AGGCTGGGAA 1440 GTCCTGGCTG TTGTGGTCCC TGTGCCCCCC TTTACCTGCC TGCTCCGGGA CCTGGTGCCT 1500 GCCACCAACT ACAGCCTCAG GGTGCGCTGT GCCAATGCCT TGGGGCCCTC TCCCTATGCT 1560 GACTGGGTGC CCTTTCAGAC CAAGGGTCTA GCCCCAGCCA GCGCTCCCCA AAACCTCCAT 1620 GCCATCCGCA CAGATTCAGG CCTCATCTTG GAGTGGGAAG AAGTGATCCC CGAGGCCCCT 1680 TTGGAAGGCC CCCTGGGACC CTACAAACTG TCCTGGGTTC AAGACAATGG AACCCAGGAT 1740 GAGCTGACAG TGGAGGGGAC CAGGGCCAAT TTGACAGGCT GGGATCCCCA AAAGGACCTG 1800 ATCGTACGTG TGTGCGTCTC CAATGCAGTT GGCTGTGGAC CCTGGAGTCA GCCACTGGTG 1860 GTCTCTTCTC ATGACCGTGC AGGCCAGCAG GGCCCTCCTC ACAGCCGCAC ATCCTGGGTA 1920 CCTGTGGTCC TTGGTGTGCT AACGGCCCTG GTGACGGCTG CTGCCCTGGC CCTCATCCTG 1980 CTTCGAAAGA GACGGAAAGA GACGCGGTTT GGGCAAGCCT TTGACAGTGT CATGGCCCGG 2040 GGAGAGCCAG CCGTTCACTT CCGGGCAGCC CGGTCCTTCA ATCGAGAAAG GCCCGAGCGC 2100 ATCGAGGCCA CATTGGACAG CTTGGGCATC AGCGATGAAC TAAAGGAAAA ACTGGAGGAT 2160 GTGCTCATCC CAGAGCAGCA GTTCACCCTG GGCCGGATGT TGGGCAAAGG AGAGTTTGGT 2220 TCAGTGCGGG AGGCCCAGCT GAAGCAAGAG GATGGCTCCT TTGTGAAAGT GGCTGTGAAG 2280 ATGCTGAAAG CTGACATCAT TGCCTCAAGC GACATTGAAG AGTTCCTCAG GGAAGCAGCT 2340 TGCATGAAGG AGTTTGACCA TCCACACGTG GCCAAACTTG TTGGGGTAAG CCTCCGGAGC 2400 AGGGCTAAAG GCCGTCTCCC CATCCCCATG GTCATCTTGC CCTTCATGAA GCATGGGGAC 2460 CTGCATGCCT TCCTGCTCGC CTCCCGGATT GGGGAGAACC CCTTTAACCT ACCCCTCCAG 2520 ACCCTGATCC GGTTCATGGT GGACATTGCC TGCGGCATGG AGTACCTGAG CTCTCGGAAC 2580 TTCATCCACC GAGACCTGGC TGCTCGGAAT TGCATGCTGG CAGAGGACAT GACAGTGTGT 2640 GTGGCTGACT TCGGACTCTC CCGGAAGATC TACAGTGGGG ACTACTATCG TCAAGGCTGT 2700 GCCTCCAAAC TGCCTGTCAA GTGGCTGGCC CTGGAGAGCC TGGCCGACAA CCTGTATACT 2760 GTGCAGAGTG ACGTGTGGGC GTTCGGGGTG ACCATGTGGG AGATCATGAC ACGTGGGCAG 2820 ACGCCATATG CTGGCATCGA AAACGCTGAG ATTTACAACT ACCTCATTGG CGGGAACCGC 2880 CTGAAACAGC CTCCGGAGTG TATGGAGGAC GTGTATGATC TCATGTACCA GTGCTGGAGT 2940 GCTGACCCCA AGCAGCGCCC GAGCTTTACT TGTCTGCGAA TGGAACTGGA GAACATCTTG 3000 GGCCAGCTGT CTGTGCTATC TGCCAGCCAG GACCCCTTAT ACATCAACAT CGAGAGAGCT 3060 GAGGAGCCCA CTGTGGGAGG CAGCCTGGAG CTACCTGGCA GGGATCAGCC CTACAGTGGG 3120 GCTGGGGATG GCAGTGGCAT GGGGGCAGTG GGTGGCACTC CCAGTGACTG TCGGTACATA 3180 CTCACCCCCG GAGGGCTGGC TGAGCAGCCA GGGCAGGCAG AGCACCAGCC AGAGAGTCCC 3240 CTCAATGAGA CACAGAGGCT TTTGCTGCTG CAGCAAGGGC TACTGCCACA CAGTAGCTGT 3300 TAGCCCACAG GCAGAGGGCA TCGGGGCCAT TTGGCCGGCT CTGGTGGCCA CTGAGCTGGC 3360 TGACTAAGCC CCGTCTGACC CCAGCCCAGA CAGCAAGGTG TGGAGGCTCC TGTGGTAGTC 3420 CTCCCAAGCT GTGCTGGGAA GCCCGGACTG ACCAAATCAC CCAATCCCAG TTCTTCCTGC 3480 AACCACTCTG TGGCCAGCCT GGCATCAGTT TAGGCCTTGG CTTGATGGAA GTGGGCCAGT 3540 CCTGGTTGTC TGAACCCAGG CAGCTGGCAG GAGTGGGGTG GTTATGTTTC CATGGTTACC 3600 ATGGGTGTGG ATGGCAGTGT GGGGAGGGCA GGTCCAGCTC TGTGGGCCCT ACCCTCCTGC 3660 TGAGCTGCCC CTGCTGCTTA AGTGCATGCA TTGAGCTGCC TCCAGCCTGG TGGCCCAGCT 3720 ATTACCACAC TTGGGGTTTA AATATCCAGG TGTGCCCCTC CAAGTCAGAA AGAGATGTCC 3780 TTGTAATATT CCCTTTTAGG TGAGGGTTGG TAAGGGGTTG GTATCTCAGG TCTGAATCTT 3840 CACCATCTTT CTGATTCCGC ACCCTGCCTA CGCCAGGAGA AGTTGAGGGG AGCATGCTTC 3900 CCTGCAGCTG ACCGGGTCAC ACAAAGGCAT GCTGGAGTAC CCAGCCTATC AGGTGCCCCT 3960 CTTCCAAAGG CAGCGTGCCG AGCCAGCAAG AGGAAGGGGT GCTGTGAGGC TTGCCCAGGA 4020 GCAAGTGAGG CCGGAGAGGA GTTCAGGAAC CCTTCTCCAT ACCCACAATC TGAGCACGCT 4080 ACCAAATCTC AAAATATCCT AAGACTAACA AAGGCAGCTG TGTCTGAGCC CAACCCTTCT 4140 AAACGGTGAC CTTTAGTGCC AACTTCCCCT CTAACTGGAC AGCCTCTTCT GTCCCAAGTC 4200 TCCAGAGAGA AATCAGGCCT GATGAGGGGG AATTCCTGGA ACCTGGACCC CAGCCTTGGT 4260 GGGGGAGCCT CTGGAATGCA TGGGGCGGGT CCTAGCTGTT AGGGACATTT CCAAGCTGTT 4320 AGTTGCTGTT TAAAATAGAA ATAAAATTGA AGACTAAAGA CCTA 4364 2550 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 10 GCAGGTCTGA AGCTCATGGG AGCCCCGGTG AAGCTGACAG TGTCTCAGGG GCAGCCGGTG 60 AAGCTCAACT GCAGTGTGGA GGGGATGGAG GAGCCTGACA TCCAGTGGGT GAAGGATGGG 120 GCTGTGGTCC AGAACTTGGA CCAGTTGTAC ATCCCAGTCA GCGAGCAGCA CTGGATCGGC 180 TTCCTCAGCC TGAAGTCAGT GGAGCGCTCT GACGCCGGCC GGTACTGGTG CCAGGTGGAG 240 GATGGGGGTG AAACCGAGAT CTCCCAGCCA GTGTGGCTCA CGGTAGAAGG TGTGCCATTT 300 TTCACAGTGG AGCCAAAAGA TCTGGCAGTG CCACCCAATG CCCCTTTCCA ACTGTCTTGT 360 GAGGCTGTGG GTCCCCCTGA ACCTGTTACC ATTGTCTGGT GGAGAGGAAC TACGAAGATC 420 GGGGGACCCG CTCCCTCTCC ATCTGTTTTA AATGTAACAG GGGTGACCCA GAGCACCATG 480 TTTTCCTGTG AAGCTCACAA CCTAAAAGGC CTGGCCTCTT CTCGCACAGC CACTGTTCAC 540 CTTCAAGCAC TGCCTGCAGC CCCCTTCAAC ATCACCGTGA CAAAGCTTTC CAGCAGCAAC 600 GCTAGTGTGG CCTGGATGCC AGGTGCTGAT GGCCGAGCTC TGCTACAGTC CTGTACAGTT 660 CAGGTGACAC AGGCCCCAGG AGGCTGGGAA GTCCTGGCTG TTGTGGTCCC TGTGCCCCCC 720 TTTACCTGCC TGCTCCGGGA CCTGGTGCCT GCCACCAACT ACAGCCTCAG GGTGCGCTGT 780 GCCAATGCCT TGGGGCCCTC TCCCTATGCT GACTGGGTGC CCTTTCAGAC CAAGGGTCTA 840 GCCCCAGCCA GCGCTCCCCA AAACCTCCAT GCCATCCGCA CAGATTCAGG CCTCATCTTG 900 GAGTGGGAAG AAGTGATCCC CGAGGCCCCT TTGGAAGGCC CCCTGGGACC CTACAAACTG 960 TCCTGGGTTC AAGACAATGG AACCCAGGAT GAGCTGACAG TGGAGGGGAC CAGGGCCAAT 1020 TTGACAGGCT GGGATCCCCA AAAGGACCTG ATCGTACGTG TGTGCGTCTC CAATGCAGTT 1080 GGCTGTGGAC CCTGGAGTCA GCCACTGGTG GTCTCTTCTC ATGACCGTGC AGGCCAGCAG 1140 GGCCCTCCTC ACAGCCGCAC ATCCTGGGTA CCTGTGGTCC TTGGTGTGCT AACGGCCCTG 1200 GTGACGGCTG CTGCCCTGGC CCTCATCCTG CTTCGAAAGA GACGGAAAGA GACGCGGTTT 1260 GGGCAAGCCT TTGACAGTGT CATGGCCCGG GGAGAGCCAG CCGTTCACTT CCGGGCAGCC 1320 CGGTCCTTCA ATCGAGAAAG GCCCGAGCGC ATCGAGGCCA CATTGGACAG CTTGGGCATC 1380 AGCGATGAAC TAAAGGAAAA ACTGGAGGAT GTGCTCATCC CAGAGCAGCA GTTCACCCTG 1440 GGCCGGATGT TGGGCAAAGG AGAGTTTGGT TCAGTGCGGG AGGCCCAGCT GAAGCAAGAG 1500 GATGGCTCCT TTGTGAAAGT GGCTGTGAAG ATGCTGAAAG CTGACATCAT TGCCTCAAGC 1560 GACATTGAAG AGTTCCTCAG GGAAGCAGCT TGCATGAAGG AGTTTGACCA TCCACACGTG 1620 GCCAAACTTG TTGGGGTAAG CCTCCGGAGC AGGGCTAAAG GCCGTCTCCC CATCCCCATG 1680 GTCATCTTGC CCTTCATGAA GCATGGGGAC CTGCATGCCT TCCTGCTCGC CTCCCGGATT 1740 GGGGAGAACC CCTTTAACCT ACCCCTCCAG ACCCTGATCC GGTTCATGGT GGACATTGCC 1800 TGCGGCATGG AGTACCTGAG CTCTCGGAAC TTCATCCACC GAGACCTGGC TGCTCGGAAT 1860 TGCATGCTGG CAGAGGACAT GACAGTGTGT GTGGCTGACT TCGGACTCTC CCGGAAGATC 1920 TACAGTGGGG ACTACTATCG TCAAGGCTGT GCCTCCAAAC TGCCTGTCAA GTGGCTGGCC 1980 CTGGAGAGCC TGGCCGACAA CCTGTATACT GTGCAGAGTG ACGTGTGGGC GTTCGGGGTG 2040 ACCATGTGGG AGATCATGAC ACGTGGGCAG ACGCCATATG CTGGCATCGA AAACGCTGAG 2100 ATTTACAACT ACCTCATTGG CGGGAACCGC CTGAAACAGC CTCCGGAGTG TATGGAGGAC 2160 GTGTATGATC TCATGTACCA GTGCTGGAGT GCTGACCCCA AGCAGCGCCC GAGCTTTACT 2220 TGTCTGCGAA TGGAACTGGA GAACATCTTG GGCCAGCTGT CTGTGCTATC TGCCAGCCAG 2280 GACCCCTTAT ACATCAACAT CGAGAGAGCT GAGGAGCCCA CTGTGGGAGG CAGCCTGGAG 2340 CTACCTGGCA GGGATCAGCC CTACAGTGGG GCTGGGGATG GCAGTGGCAT GGGGGCAGTG 2400 GGTGGCACTC CCAGTGACTG TCGGTACATA CTCACCCCCG GAGGGCTGGC TGAGCAGCCA 2460 GGGCAGGCAG AGCACCAGCC AGAGAGTCCC CTCAATGAGA CACAGAGGCT TTTGCTGCTG 2520 CAGCAAGGGC TACTGCCACA CAGTAGCTGT 2550 1158 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 11 GCAGGCCTGA AGCTCATGGG CGCCCCAGTG AAGATGACCG TGTCTCAGGG GCAGCCAGTG 60 AAGCTCAACT GCAGCGTGGA GGGGATGGAG GACCCTGACA TCCACTGGAT GAAGGATGGC 120 ACCGTGGTCC AGAATGCAAG CCAGGTGTCC ATCTCCATCA GCGAGCACAG CTGGATTGGC 180 TTACTCAGCC TAAAGTCAGT GGAGCGGTCT GATGCTGGCC TGTACTGGTG CCAGGTGAAG 240 GATGGGGAGG AAACCAAGAT CTCTCAGTCA GTATGGCTCA CTGTCGAAGG TGTGCCATTC 300 TTCACAGTGG AACCAAAAGA TCTGGCGGTG CCACCCAATG CCCCTTTTCA GCTGTCTTGT 360 GAGGCTGTGG GTCCTCCAGA ACCCGTAACC ATTTACTGGT GGAGAGGACT CACTAAAGTT 420 GGGGGACCTG CTCCCTCTCC CTCTGTTTTA AATGTGACAG GAGTGACCCA GCGCACAGAG 480 TTTTCTTGTG AAGCCCGCAA CATAAAAGGC CTGGCCACTT CCCGACCAGC CATTGTTCGC 540 CTTCAAGCAC CGCCTGCAGC TCCTTTCAAC ACCACAGTAA CAACGATCTC CAGCTACAAC 600 GCTAGCGTGG CCTGGGTGCC AGGTGCTGAC GGCCTAGCTC TGCTGCATTC CTGTACTGTA 660 CAGGTGGCAC ACGCCCCAGG AGAATGGGAG GCCCTTGCTG TTGTGGTTCC TGTGCCACCT 720 TTTACCTGCC TGCTTCGGAA CTTGGCCCCT GCCACCAACT ACAGCCTTAG GGTGCGCTGT 780 GCCAATGCCT TGGGCCCTTC TCCCTACGGC GACTGGGTGC CCTTTCAGAC AAAGGGCCTA 840 GCGCCAGCCA GAGCTCCTCA GAATTTCCAT GCCATTCGTA CCGACTCAGG CCTTATCCTG 900 GAATGGGAAG AAGTGATTCC TGAGGACCCT GGGGAAGGCC CCCTAGGACC TTATAAGCTG 960 TCCTGGGTCC AAGAAAATGG AACCCAGGAT GAGCTGATGG TGGAAGGGAC CAGGGCCAAT 1020 CTGACCGACT GGGATCCCCA GAAGGACCTG ATTTTGCGTG TGTGTGCCTC CAATGCAATT 1080 GGTGATGGGC CCTGGAGTCA GCCACTGGTG GTGTCTTCTC ATGACCATGC AGGGAGGCAG 1140 GGCCCTCCCC ACAGCCGC 1158 1158 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 12 GCAGGTCTGA AGCTCATGGG AGCCCCGGTG AAGCTGACAG TGTCTCAGGG GCAGCCGGTG 60 AAGCTCAACT GCAGTGTGGA GGGGATGGAG GAGCCTGACA TCCAGTGGGT GAAGGATGGG 120 GCTGTGGTCC AGAACTTGGA CCAGTTGTAC ATCCCAGTCA GCGAGCAGCA CTGGATCGGC 180 TTCCTCAGCC TGAAGTCAGT GGAGCGCTCT GACGCCGGCC GGTACTGGTG CCAGGTGGAG 240 GATGGGGGTG AAACCGAGAT CTCCCAGCCA GTGTGGCTCA CGGTAGAAGG TGTGCCATTT 300 TTCACAGTGG AGCCAAAAGA TCTGGCAGTG CCACCCAATG CCCCTTTCCA ACTGTCTTGT 360 GAGGCTGTGG GTCCCCCTGA ACCTGTTACC ATTGTCTGGT GGAGAGGAAC TACGAAGATC 420 GGGGGACCCG CTCCCTCTCC ATCTGTTTTA AATGTAACAG GGGTGACCCA GAGCACCATG 480 TTTTCCTGTG AAGCTCACAA CCTAAAAGGC CTGGCCTCTT CTCGCACAGC CACTGTTCAC 540 CTTCAAGCAC TGCCTGCAGC CCCCTTCAAC ATCACCGTGA CAAAGCTTTC CAGCAGCAAC 600 GCTAGTGTGG CCTGGATGCC AGGTGCTGAT GGCCGAGCTC TGCTACAGTC CTGTACAGTT 660 CAGGTGACAC AGGCCCCAGG AGGCTGGGAA GTCCTGGCTG TTGTGGTCCC TGTGCCCCCC 720 TTTACCTGCC TGCTCCGGGA CCTGGTGCCT GCCACCAACT ACAGCCTCAG GGTGCGCTGT 780 GCCAATGCCT TGGGGCCCTC TCCCTATGCT GACTGGGTGC CCTTTCAGAC CAAGGGTCTA 840 GCCCCAGCCA GCGCTCCCCA AAACCTCCAT GCCATCCGCA CAGATTCAGG CCTCATCTTG 900 GAGTGGGAAG AAGTGATCCC CGAGGCCCCT TTGGAAGGCC CCCTGGGACC CTACAAACTG 960 TCCTGGGTTC AAGACAATGG AACCCAGGAT GAGCTGACAG TGGAGGGGAC CAGGGCCAAT 1020 TTGACAGGCT GGGATCCCCA AAAGGACCTG ATCGTACGTG TGTGCGTCTC CAATGCAGTT 1080 GGCTGTGGAC CCTGGAGTCA GCCACTGGTG GTCTCTTCTC ATGACCGTGC AGGCCAGCAG 1140 GGCCCTCCTC ACAGCCGC 1158 19 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 13 TGGATGGCAG TAAGGGAGG 19 20 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 14 CTTAAGAGGG GCAAACCTGG 20 21 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 15 GCTTAGAGGA GGTGAGCCAG A 21 18 BASE PAIRS NUCLEIC ACID SINGLE LINEAR cDNA 16 TGGGCAGTGC TGAGTTCC 18 

1. A mammalian receptor tyrosine kinase which is a developmental tyrosine kinase (Dtk) and which is expressed in multipotential haematopoietic cells, in embryonic stem cells, in brain tissue and in testis, but which is not expressed in mature lineage-restricted haematopoietic cells.
 2. A receptor tyrosine kinase according to claim 1 that is murine Dtk having the amino acid sequence of SEQ ID NO 1, or a functional equivalent thereof.
 3. A receptor tyrosine kinase according to claim 1 that is mature murine Dtk having the amino acid sequence of SEQ ID NO
 2. 4. A receptor tyrosine kinase according to claim 1 that is human Dtk having the amino acid sequence of SEQ ID NO 3, or a functional equivalent thereof.
 5. A receptor tyrosine kinase according to claim 1 that is mature human Dtk having the amino acid sequence of SEQ ID NO
 4. 6. An extracellular receptor domain of a receptor tyrosine kinase according to claim
 1. 7. An extracellular receptor domain which is the extracellular receptor domain of mature murine Dtk as defined in claim 3, or a functional equivalent thereof.
 8. An extracellular receptor domain of a receptor tyrosine kinase having the amino acid sequence of SEQ ID NO
 5. 9. An extracellular receptor domain which is the extracellular receptor domain of mature human Dtk as defined in claim 5, or a functional equivalent thereof.
 10. An extracellular receptor domain of a receptor tyrosine kinase having the amino acid sequence of SEQ ID NO
 6. 11. An extracellular receptor domain according to any one of claims 6 to 10 which is bound or attached to a support.
 12. A soluble receptor comprising the extracellular receptor domain of a receptor tyrosine kinase according to any one of claims 1 to 5 lacking the transmembrane region and catalytic domain of said receptor tyrosine kinase.
 13. A nucleic acid molecule encoding a receptor tyrosine kinase as defined in claim
 1. 14. A nucleic acid molecule encoding murine Dtk or a functional equivalent thereof as defined in claim
 2. 15. A nucleic acid molecule according to claim 14 which is DNA.
 16. A DNA molecule according to claim 15 having the nucleotide sequence of SEQ ID NO
 7. 17. A nucleic acid molecule encoding mature murine Dtk as defined in claim
 3. 18. A nucleic acid molecule according to claim 17 which is DNA.
 19. A DNA molecule according to claim 18 having the nucleotide sequence of SEQ ID NO
 8. 20. A nucleic acid molecule encoding human Dtk or a functional equivalent thereof as defined in claim
 4. 21. A nucleic acid molecule according to claim 20 which is DNA.
 22. A DNA molecule according to claim 21 having the nucleotide sequence of SEQ ID NO
 9. 23. A nucleic acid molecule encoding mature human Dtk as defined in claim
 5. 24. A nucleic acid molecule according to claim 23 which is DNA.
 25. A DNA molecule according to claim 24 having the nucleotide sequence of SEQ ID NO
 10. 26. A nucleic acid molecule encoding an extracellular receptor domain as defined in claim
 6. 27. A nucleic acid molecule encoding the extracellular receptor domain of murine Dtk or a functional equivalent thereof as defined in claim
 7. 28. A nucleic acid molecule according to claim 27 which is DNA.
 29. A DNA molecule according to claim 28 having the nucleotide sequence of SEQ ID NO
 11. 30. A nucleic acid molecule encoding the extracellular receptor domain of human Dtk or a functional equivalent thereof as defined in claim
 9. 31. A nucleic acid molecule according to claim 30 which is DNA.
 32. A DNA molecule according to claim 31 having the nucleotide sequence of SEQ ID NO
 12. 33. A vector including a DNA molecule a s defined in claim
 13. 34. A vector including a DNA molecule as defined in any one of claims 15, 16, 18 and
 19. 35. A vector including a DNA molecule as defined in any one of claims 21, 22, 24 and
 25. 36. A vector including a DNA molecule as defined in claim 28 or claim
 29. 37. A vector including a DNA molecule as defined in claim 31 or claim
 32. 38. A method of producing a receptor tyrosine kinase comprising the steps of: (a) culturing a host cell which has been transformed or transfected with a vector as claimed in any one of claims 33-35 to express the encoded receptor tyrosine kinase; and (b) recovering the expressed receptor tyrosine kinase.
 39. A method of producing an extracellular receptor domain of a receptor tyrosine kinase comprising the steps of: (a) culturing a host cell which has been transformed or transfected with a vector as claimed in claim 36 or claim 37 to express the encoded extracellular receptor domain; and (b) recovering the expressed extracellular receptor domain.
 40. A recombinant receptor tyrosine kinase which is the product of a method as defined in claim
 38. 41. A recombinant extracellular receptor domain which is the product of a method as defined in claim
 39. 42. A ligand that binds to a receptor tyrosine kinase as defined in claim
 1. 43. A ligand that binds to a receptor tyrosine kinase as defined in claim
 2. 44. A ligand that binds to a receptor tyrosine kinase as defined in claim
 3. 45. A ligand that binds to a receptor tyrosine kinase as defined in claim
 4. 46. A ligand that binds to a receptor tyrosine kinase as defined in claim
 5. 47. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as defined in claim
 6. 48. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as defined in claim
 7. 49. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as defined in claim
 8. 50. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as defined in claim
 9. 51. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as claimed in claim
 10. 52. A ligand that binds to an extracellular receptor domain of a receptor tyrosine kinase as claimed in claim
 11. 53. A ligand that binds to a soluble receptor as defined in claim
 12. 54. A ligand that binds to a receptor tyrosine kinase as claimed in claim
 40. 55. A ligand that binds to an extracellular receptor domain as claimed in claim
 41. 56. A ligand according to any one of claims 42-55 wherein the ligand stimulates the proliferation, differentiation and/or survival of cells which express a receptor tyrosine kinase according to claim
 1. 57. A ligand according to any one of claims 42-55 wherein the ligand is antagonistic and at least partially blocks or inhibits the function of a receptor tyrosine kinase according to claim 1 through binding to said receptor.
 58. A method of stimulating the proliferation, differentiation and/or survival of a cell expressing a receptor tyrosine kinase according to claim 1 comprising contacting the cell with a ligand according to claim
 56. 59. A method according to claim 58 wherein the stimulation occurs in vivo.
 60. A method according to claim 58 wherein the stimulation occurs a vivo.
 61. A method of inhibiting the function of a receptor tyrosine kinase according to claim 1 comprising contacting the receptor with a ligand according to claim
 57. 62. A method according to claim 61 wherein the inhibition occurs in vivo.
 63. A method according to claim 61 wherein the inhibition occurs ex vivo.
 64. A method of treating a disease, syndrome or condition caused or mediated by an excess of a ligand as claimed in claim 56 comprising the step of contacting said excess of said ligand with an effective amount of a receptor tyrosine kinase according to any one of claims 1-5 and 40, an extracellular receptor domain according to any one of claims 6-11 and 41 or a soluble receptor according to claim
 12. 65. A method of treating a disease, syndrome or condition caused or mediated by an excess of a ligand as defined in claim 57 comprising the step of contacting said excess of said ligand with an effective amount of a receptor tyrosine kinase according to any one of claims 1-5 and 40, an extracellular receptor domain according to any one of claims 6-11 and 41 or a soluble receptor according to claim
 12. 66. A method of extracting a ligand as defined in claim 56 or claim 57 from a medium which may contain said ligand comprising the step of contacting said medium with a receptor tyrosine kinase according to any one of claims 1-5 and 40, an extracellular receptor domain according to any one of claims 6-11 and 41 or a soluble receptor according to claim
 12. 67. A method of isolating ligand(s) as defined in claim 56 or claim 57 from a medium which may contain said ligand(s), comprising the steps of: (a) contacting said medium with an effective amount of a receptor tyrosine kinase according to any one of claims 1-5 and 40, an extracellular domain according to any one of claims 6-11 and 41 or a soluble receptor according to claim 12; (b) detecting which ligand(s) bind to said tyrosine kinase receptor, extracellular receptor domain or soluble receptor; and (c) isolating such bound ligand(s).
 68. A ligand which is isolated by a method according to claim
 67. 